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The Golden Guess 
Is Morning-Star to the full round of Truth. 

— Tennyson. 


Preface 

This text represents an extensive expansion and revision, both in scope and 
content, of the earlier work entitled “Military Cryptanalysis, Part I” by William 
F. Friedman. This expansion and revision was necessitated by the considerable 
advancement made in the art since the publication of the previous text. 

I wish to express grateful acknowledgment for Mr. Friedman’s generous as¬ 
sistance and invaluable collaboration in the preparation of this edition. I also 
extend particular appreciation to my colleague Robert E. Cefail for his numerous 
valuable comments and assistance in writing the new material which is contain¬ 
ed herein. 


- L. D. C . 
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r . ^.SECTION I 
'jSfifebDUCTORY* REMARKS ' 

- ^r~r v* . . 'J- 

- Paragraph 


Scope of this text............. X 

Mental equipment necessary for crypt analytic work. 2 

Validity of results of cryptanalysis. 3 


1. Scope of this tex t.— s_ This text constitutes the first of a 

series of six basic texts~on the art of cryptanalysis . Although most of 
the information contained herein is applicable to cryptograms of various 
types and sources, special emphasis will be laid upon the principles and 
methods of solving military 1 2 cryptograms. Except for an introductory 
discussion of fundamental principles underlying the science of cryptana¬ 
lytics , this first text in the series will deal solely with the principles 
and methods for the analysis of monoalphabetic substitution ciphers . 

Even with this limitation it will be possible to discuss only a few of the 
many variations of this one type that are met in practice; but with a 
firm grasp upon the general principles few difficulties should be expe¬ 
rienced with any modifications or variations that may be encountered. 

4. to* _ to 

b. This and some of the succeeding texts will deal only with basic 
types of cryptosystems not because they may be encountered unmodified in 
military operations but because their study is essential to an under¬ 
standing of the principles underlying the solution of the modern, very 
much more complex types of codes, ciphers, and certain encrypted trans¬ 
mission systems that are likely to be employed by the larger governments 
of today in the conduct of their military affairs in time of war. 

£. It is presupposed that the student has no prior background in 
the field of cryptology; therefore cryptography is presented concurrently 
with cryptanalysis. Basic terminology and preliminary cryptologic consi¬ 
derations are treated in Section II; other terms are usually defined upon 
their first occurrence, or they may be found in the Glossary (Appendix l). 

d. The cryptograms presented in the examples embrace messages from 
hypothetical air, ground, and naval traffic; thus, the student will have 
the opportunity to familiarize himself with the language and phraseology 
of all three Services comprising the Armed Forces of the United States. 


1 Each text has its accompanying course in cryptanalysis, so that the 
student may test his learning and develop his skill in the solution of 
the types of cryptograms treated in the respective texts. The problems 
which pertain to this text, constitute Appendix 13. 

2 The word "military" is here used in its broadest sense. In this 
connection see subpar. d, below. 
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2. Mental equipment necessary for cryptanalytic work . —-a. Captain 
Parker Hitt, in the first United States Army manual^ dealing with cryp¬ 
tology, opens the first chapter of his valuable treatise with the 
following sentence: 

"Success in dealing with unknown ciphers is measured by these four 
things in the order named: perseverance, careful methods of analysis, 
intuition, luck." 

These words arc as true today as they were then. There is no royal 
road to success in the solution of cryptograms. Hitt goes on to say: 

"Cipher work wi31 have little permanent attraction for one who 
expects results at once, without labor, for there is a vast auount of 
purely routine labor in the preparation of frequency tables, the rearrange' 
ment of ciphers for examination, and the trial and fitting of letter to 
letter before the message begins to appear.” 

The present author deems it advisable to add that the kind of work 
involved in solving cryptograms is not at all similar to that involved in 
solving crossword puzzles, for example. The wide vogue the latter have 
had and continue to have is due to the appeal they make to the quite 
common interest in mysteries of one sort or another; but in solving a 
crossword puzzle there is usually no necessity for performing any preli¬ 
minary labor, and palpable results become evident after the first minute 
or two of attention. This successful start spurs the crossword "addict” 
on to complete the solution, which rarely requires more than an hour’s 
time. Furthermore, crossword puzzles are all alike in basic principles 
and once understood, there is no more to learn. Skill comes largely from 
the embellishment of one's vocabulary, though, to be sure, constant prac¬ 
tice and exercise of the imagination contribute to the ease and rapidity 
with vhich solutions are generally reached. In solving cryptograms, 
however, many principles must be learned, for there are many different 
systems of varying degrees of complexity. Even some of the simpler 
varieties require the preparation of tabulations of one sort or another, 
which many people find irksome; moreover, it is only toward the very close 
of the solution that results in the form of intelligible text become 
evident. Often, indeed, the student will not even know whether he is on 
the right "raci- until he has performed a large amount of preliminary 
"spade work." involving many hours of labor. Thus, without at least a 
willingness to pursue a fair amount of theoretical study, and a more than 
average amount of patience aud perseverance , little skill and experience 
can be gained in the rather difficult art of cryptanalysis. General 
Givierge, the author of an excellent treatise on cryptanalysis, remarks 
in this connection:^ 

"The cryptanalyst’s attitude must be that of William the Silent: 

No need to hope in order to undertake, nor to succeed in order to 
persevere." 


3 Hitt, Capt. Parker, Manual for the Solution of Military Ciphers . 

Army Service Schools Press. Fort Leavenworth, Kansas, 1916 . 2 d Edition,- 
1918. (Both out oC print.) 

^ Givierge, General Marcel, Cours de Cryptographic, Paris, 1925 , p. 301. 
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b. As regards Hitt's reference to careful methods of analysis, 

"before one can "be said to "be a cryptanalyst worthy of the name it is 
necessary that one should have, firstly, a sound knowledge of the "basic 
principles of cryptanalysis, and secondly, a long, varied, and active 
practical experience in the successful application of those principles. 

It is not sufficient to have read treatises on this subject. One month's 
actual practice in solution is worth a whole year's mere reading of 
theoretical principles. An exceedingly important element of success in 
solving the more Intricate cryptosystems is the possession of the rather 
unusual mental faculty designated in general terms as the power of in¬ 
ductive and deductive reasoning. Probably this is an Inherited father 
than an acquired faculty; the "best sort of training for its emergence, 

if latent in the individual, and for its development is the study of 
the natural sciences, such as chemistry, physics, biology, geology, and 
the like. Other sciences such as linguistics, archaeology, and philology 
are also excellent. 

c. Aptitude in mathematics is quite important, more especially in 
the solution of ciphers and enciphered codes than in codebook reconstruc¬ 
tion, which latter is purely and simply a linguistic problem. Although 
in the early days of the emergence of the science of cryptanalytics little 
thought was given to the applications of mathematics in this field, many 
branches of mathematics and, in particular, probability and statistics, 
have now found cryptologic applications. Those portions of mathematics 
and those mathematical methods which have cryptologic applications5 are 
known collectively as cryptomathematics . 


5 It is quite important to stress at this point that in professional 
cryptologic work the science of’cryptanalytics is subordinated to the 
art of cryptanalysis, just as in ,the world of music the technical virtu¬ 
osity of a great violinist is adjuvant to the expression of music, that 
is, the virtuosity is a "tool” for the recovery of the complete musical 
"plain text" conceived by the composer. Since the practice of cryptana¬ 
lysis is an art, mathematical approaches cannot always be expected to 
yield a solution in cryptology, because art can and must transcend the 
cold logic of scientific method. By way of example, an experienced 
Indian guide can usually find his way out of a dense forest more readily 
than a surveyor equipped with all the refined apparatus and techniques 
of his profession. Likewise, an experienced cryptanalyst can generally 
find his way through a cryptosystem more readily than a pure mathematician 
equipped merely with the techniques of his field no matter how abstruse 
or refined they may be. A cryptomathematician of repute once stated 
that "the only effect of ^/refined mathematical technique^ is frequently 
to discourage one so much that one does nothing at all and some unmathe- 
matical ignoramus then gets the problem out in some very unethical way. 
This is intensely irritating»" See also in this connection the remarks 
made in subpar. 27 e in reference to the validity of statistical tests 
in cryptanalysis. ” " ’ 
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d. An active imagination, or perhaps what Hitt and other writers 
call intuition , is essential, hut mere imagination uncontrolled hy a 
judicious spirit will he more often a hindrance than'a help. In prac¬ 
tical cryptanalysis the imaginative or intuitive faculties must, in other 
words, he guided hy good judgment, hy practical experience, and hy as 
thorough a knowledge of the general situation or extraneous circumstances 
that led to the sending of the cryptogram as is possible to obtain. In 
this respect the many cryptograms exchanged between correspondents whose 
identities and general affairs, commercial, social, or political, are 
known are far more readily solved® than are isolated cryptograms exchanged 
between unknown correspondents, dealing with unknown subjects. It is 
obvious tli n t in the former case there are good data upon which the intu¬ 
itive povej t of th® cryptanalyst can be brought to bear, whereas In the 
latter cas * no such d^fca are available. Consequently, in the absence of 
s”ch data, no matter how good the imagination and intuition of the cryp¬ 
tanalyst, these powers are of no particular service to him. Some writers, 
however, regard the intuitive spirit as valuable from still another 
viewpoint, as may be noted in the following 

"Intuition, like a flash of lightning, lasts only for a second. 

It generally comes when one is tormented by a difficult decipherment and 
when one reviews in his mind the fruitless experiments already tried. 
Suddenly the light breaks through and one finds after a few minutes what 
previous days of labor were unable to reveal." 

This, too, is true, but unfortunately there is no way in which the 
intuition may be summoned at will, when it is most needed.® There are 
certain authors who regard as indispensable the possession of a somewhat 

~ 6 The application In practical, operational cryptanalysis of "probable 

words" or "cribs", i.e., plain text assumed or known to be present in a 
cryptogram, is developed in time of war Into a refinement the extent and 
usefulness of which cannot be appreciated by the uninitiated. Even as 
great a thinker as Voltaire found the subject of cryptanalysis stretching 
his credulity, to the point that he said: 

"Those who boast that they can decipher a letter without knowing its 
subject matter, and without preliminary aid, are greater charlatans than 
those who would boast of understanding a language which they have never 
learned."— Dictionnaire Philosophique , under the article "Poste". 

7 Lange et Sou&art, Traite de Cryptographie , Libraire Felix Alcan, 
Paris, 1925, p. 10k. 

8 The following extracts are of interest in this connection: 

“The fact that the scientific investigator works 50 per cent of his time by non-rafcional means is, it seems, quite 
insufficiently recognized. There is without the least doubt an instinct for research, and often the most successful 
investigators of nature are quite unaMc to give an account of their reasons for doing such and such an experi¬ 
ment, or for placing side by side two apparently unrelated facts Again, one Of the most salient traits ip the 
chaiacter of the successful scientific worker is the capacity for knowing that a point is proved when it would not 
appeal to be proved to an outside intelligence functioning in a purely rational manner; thus the investigator 
feels that some pi oposition is true, and proceeds at once to the next set of experiments without waiting and wasting 
time in the elaboration of the founnl proof of the point which heavier minds w'ould need. Questionless such a 
scientific intuition may and does sometimes lead investigators astray, but it is quite certain that if they did 
not widely make use of it, they would not get a quarter as far as they do. Expeiiments confirm each other, and a 
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rare, rather mysterious faculty that they designate by the word "flair”, 
or by the expression "cipher brains". Even so excellent an authority as 
General Glvierge, 9 in referring to this mental faculty, uses the fol¬ 
lowing words: 

"Over and above perseverance and this aptitude of mind which some 
authors consider a special gift, and which they call intuition, or even, 
in its highest manifestation, clairvoyance, cryptographic studies will 
continue more and more to demand the qualities of orderliness and memory." 

Although the present author believes a special aptitude for the 
work is essential to cryptanalytic success, he is sure there is nothing 
mysterious about the matter at all. Special aptitude is prerequisite to 
success in all fields of endeavor. There are, for example, thousands 
of physicists, hundreds of excellent ones, but only a handful of world¬ 
wide fame. Should it be said, then, that a physicist who has achieved 
very notable success in his field has done so because he is the fortunate 
possessor of a mysterious facility? That he is fortunate in possessing a 
special aptitude for his subject is granted, but that there is anything 
mysterious about it, partaking of the nature of clairvoyance (if, indeed, 
the latter is a reality ) is not granted. While the ultimate nature of 
any mental process seems to be as complete a mystery today as it has 
ever been, the present author would like to see the superficial veil of 
mystery removed from a subject v that has been shrouded in mystery from 
even before the Middle Ages down to our own times. (The principal and 
readily understandable reasoh”for this is that governments have always 
closely guarded cryptographic secrets and anything so guarded soon becomes 
"mysterious".) He would, rather, have the student approach the subject 
as he might approach any other science that can stand on its own merits 
with other sciences, because ;8?jrotanalytics, like other sciences, has a 
practical importance in human™rfairs. It presents to the inquiring mind 
an interest in its own right as a branch of knowledge; it, too, holds 
forth many difficulties and disappointments, and these are all the more 


false step is usually soon discovered. And 110 J only by this partial replacement of reason by intuition does the 
work of science go on, but also to the born scientific worker—and emphatically they cannot be made—the struc¬ 
ture of the method of research is as it were given^ he cannot explain it to you, though he may be brought to agree 
a posteriori to a formal logical presentation of the way the method works”.—Excerpt from Needham, Joseph, 
The Sceptical Biologist, London, 1929, p. 79. 

“The essence of scientific method, quite" simply, is to try to see how data arrange themselves into causal 
configurations. Scientific problems are soh ed by collecting data and by “thinking about them all the time.” 
We need to look at strange things until, by the appearance of known configurations, they seem familiar, and to 
look at familiar things until we see novel configurations which make them appear strange. We must look at 
events until they become luminous. That U scientific method . . . Insight is the touchstone . . . The appli¬ 
cation of insight as the touchstone of method gables us to evaluate properly the role of imagination in scientific 
method. The scientific process is akin to the artistic process: it is a process of selecting out those elements of 
experience which fit together and recombining them in the mind. Much of this kind of research is simply a cease¬ 
less mulling over, and even the physical scientist has considerable need of an armchair . . . Our view of scien¬ 
tific method as a struggle to obtain insight forces the admission that science is half art . . . Insight is the 
unknown quantity which has eluded students of scientific method”.—Excerpts from an article entitled Insight and 
Scientific Method , by Willard Waller, in The American Journal of Sociology , Yol. XL, 1934* 

9 Op. cit ., p. 302. 
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keenly felt when the nature of these difficulties is not understood hy 
those unfamiliar with the special circumstances that yery often are the 
real factors that led to success in other cases. Finally, just as in 
the other sciences wherein men labor long and earnestly for the true 
satisfaction and pleasure that comes from work well done, so the mental 
pleasure that the successful cryptanalyst derives from his accomplish¬ 
ments is very often the only reward for much of the drudgery that he must 
do in his daily work. General Givlerge's words in this connection are 
well worth quoting 

"Some studies will last for years before bearing fruit. In the case 
of others, cryptanalysts undertaking them ne-ver get any result. But, for 
a cryptanalyst who likes the work, the Joy of discoveries effaces the 
memory of his hours of doubt and impatience." 

e. With his usual deft touch, Hitt says of the element of luck, as 
regards the role it plays in analysis: 

"As to luck, there is the old miners' proverb: 'Gold is where you 
find it.'" 

The cryptanalyst is lucky when one of the correspondents whose cryp¬ 
tograms he is studying makes a blunder that gives the necessary clue; or 
when he finds two cryptograms identical in text but in different keys in 
the same system; or when he finds two cryptograms identical in text but 
in different systems, and so on. The element of luck is there, to be 
sure, but the cryptanalyst must be on the alert if he is to profit by 
these lucky "breaks". 

f. If the present author were asked to state, in view of the pro- 1 
gress in the field since 1916, what elements might be added to the four 
ingredients Hitt thought essential to cryptanalytic success, he would 

be inclined to mention the following: 

(1) A broad, general education, embodying interests covering as 
many fields of practical knowledge as possible. This is useful because 
the cryptanalyst is often called upon to solve messages dealing with 
the most varied of human activities, and the more he knows about these 
activities, the easier his task. 

(2) Access' to a large library of current literature, and wide and 
direct contacts with sources of collateral information. These often 
afford clues as-to the contents of specific messages. For example,, to 
be able instantly to have at his disposal a newspaper report or a 
personal report of events described or referred to in a message Tinder 
investigation goes a long way toward simplifying or facilitating 
solution. Government cryptanalysts are sometimes fortunately situated 
in this respect, especially where various agencies work in harmony. 

(3) Proper coordination of effort.. This includes the organization 

of cryptanalytic personnel into harmonious, efficient teams of cooperating 
individuals. , . 


10 Qp» clt ., p. 301* 
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(*1) Under mental equipment he would also include the faculty of 
heing able to concoutrato on a problem for rather long periods of time, 
without distraction, nervous irritability, and impatience. The strain 
under vhich cryptanalytic studies are necessarily conducted is quite 
severe and too long-continued application has the effect of draining 
nervous energy to an unwholesome degree, so that a word or two of caution 
may not here be out of place. 4 One should continue at work only so long 
as a peaceful, calm spirit preva'ils, whether the work is fruitful or not. 
But just as soon as the mind becomes wearied with the exertion, or just 
as soon as a feeling of hopelessness or mental fatigue intervenes, it is 
better to stop completely and turn to other activities, rest, or play. 

It is essential to remark that systematization and orderliness of work 
, are aids in reducing nervous tension and irritability. On this account 
it is better to take the time to prepare the data carefully, rewrite the 
text if necessary, and so on, rather than work with slipshod, incomplete, 
or improperly arranged material. 

(5) A retentive memory 1 b an important asset to cryptanalytic skill, 
especially in the solution of codes. The ability to remember individual 
groups, their approximate locations in other messages, the associations 
they form with other groups, their peculiarities and similarities, saves 
much wear and tear of the mental machinery, as well as much time in 
looking up these groups in indexes.' 

(6) The assistance of machine aids in cryptanalysis. The importance 
and value of these aids cannot be overemphasized in their bearing on prac¬ 
tical, operational cryptanalysis, especially in the large-scale effort 
that would be made in time of war on complex, high-grade cryptosystems at 
a theater headquarters or in the zone of the interior. These aids, under 
the general category of rapid, analytical machines, comprise both punched- 
card tabulating machinery and certain other general- and special-purpose 
high-speed electrical and electronic devices. Some of the more compact 
equipment may be employed by lower echelons within a theater of operations 
to facilitate the cryptanalysis - of medium-grade cryptosystems found in 
tactical communications. 


g. It may be advisable to add a word or two at this point to prepare 
the student to expect slight mental jars and tensions which will almost 
inevitably come to him in the conscientious study of this and the sub¬ 
sequent texts. The present author is well aware of the complaint of 
students that authors of texts on cryptanalysis base much of their expla¬ 
nation upon their foreknowledge of the "answer”—which the student does 
not know while he is attempting to follow the solution with an unbiased 
mind. They complain, too, that these authors use such expressions as "lb 
Is obvious that", "naturally 1 *, "of course", "it is evident that", and 
so on, when the circumstances seem not at all to warrant their use. 

There is no question that this sort of treatment is apt to discourage 
the student, especially when the point elucidated becomes clear to him 
only after many hours’ labor, whereas, according to the book, the author 
noted the weak spot at the first moment’s inspection. The present 
author can only promise to try to avoid making the steps appear to be 
much more simple than they really are, and to suppress glaring instances 
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of unjustifiable "jumping at conclusions". At the same time he must 
indicate that for pedagogical reasons in many cases a message has been 
consciously "manipulated" so as to allow certain principles to become 
more obvious in the illustrative examples than they ever are in practical 
work. During the course of some of the explanations attention will even 
be directed to cases of unjustified inferences. Furthermore, of the 
student who is quick in observation and deduction, the author will only 
ask that he bear in mind that if the elucidation of certain principles 
seems prolix and occupies more space than necessary, this is occasioned 
by the author's desire to carry the explanation forward in very short, 
easily-comprehended, and plainly-described steps, for the benefit of 
students who are perhaps a bit slower to grasp but who, once they under¬ 
stand, are able to retain and apply principles slowly learned just as 
well, if nob better than the students who learn more quickly 

3. Validity of results of cryptanalysis .—Valid or authentic 
cryptanalytic solutions cannot and do not represent "opinions" of the 
cryptanalyst. They are valid only so far as they are wholly objective, 
and are susceptible of demonstration and proof, employing authentic, 
objective methods. It should hardly be necessary (but an attitude fre¬ 
quently encountered among laymen makes it advisable) to indicate that 
the validity of the results achieved by any serious cryptanalytic studies 
on authentic material rests upon the same sure foundations and are 
reached by the Bame general steps as the results achieved by any other 
scientific studies; viz., observation, hypothesis, deduction and induction, 
and confirmatory experiment. Implied in the latter is the possibility 
that two or more qualified investigators, each working independently 
upon the some material, will achieve identical (or practically identical) 
results—there is one and only one (valid) solution to a cryptogram. 
Occasionally a "would-be" or pseudo-cryptanalyst offers "solutions" which 
cannot withstand such tests; a second, unbiased, investigator working 
independently either cannot consistently apply the methods alleged to 
have been applied by the pseudo-cryptanalyst, or else,"if he can apply 


H In connection with the use of the word "obvious", the following 
extract is of interest; 

"How the word 'obvious' is a rather dangerous one. There is an 
incident, which has become something of a legend in mathematical circles, 
that illustrates this danger. A certain famous mathematician was lec¬ 
turing to a group of students and had occasion to vise a formula which he 
wrote down with the remark, 'This statement is obvious.' Then he paused 
and looked rather hesitantly at the formula. 'Wait a moment,' he said. 

'Is it obvious? I think it's obvious.' More hesitation, and then, 

'Pardon me, gentlemen, I shall return.' Then he left the room. Thirby- 
f ive minul.ee later he returned; in his hands was a sheaf of papers 
covered with calculations, on his face a look of quiet satisfaction. 

'I was righb, gentlemen. It is obvious,' he said, and proceeded with his 
lecture."--Excerpt from The Anatomy of Mathematics by Kershner and Wilcox. 
Hew York, 1950. 
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them at all, the results (plaintext translations) are far different In 
the two cases. The reason for this is that in such cases it is generally 
found that the "methods" are not clear-cut, straightforward or mathema¬ 
tical in character. Instead, they often involve the making of Judgments 
on matters too tenuous to measure, weigh, or otherwise subject to careful 
scrutiny. Often, too, they involve the "correction" of an inordinate 
number of "errors" which the pseudo-cryptanalyst assumes to he present and 
which he "corrects" in order to make his "solution" intelligible. And 
sometimes the pseudo-cryptanalyst offers as a "solution" plain text which 
is intelligible only to him or which he makes intelligible by expanding 
what he alleges to be abbreviations, and so on. In all such cases, the 
conclusion to which the unprejudiced observer is forced to come is that 
the alleged "solution" obtained by the pseudo-cryptanalyst is purely 
subjective.12 In nearly all cases where this has happened (and they occur 
from time to time) there has been uncovered nothing which can in any way 


12 a mathematician is often unable to grasp the concept behind the ex¬ 
pression "subjective solution" as used in the cryptanalytic field, since 
the idea is foreign to the basic philosophy of mathematics and thus the 
expression appears to him to represent a contradiction in terms. As an 
illustration, let us consider a' situation in which a would-be cryptanalyst 
offers a solution to a cryptogram he alleges to be a simple monoalphabetic 
substitution cipher. His so-called solution, however, requires that he 
assume the presence of, let us say,‘ approximately 50$ garbles (which he 
claims to have been introduced by cipher clerks* errors, faulty radio 
reception because of adverse weather conditions, etc.). That is, the 
"plain text" he offers as the "solution" involves his making helter- 
skelter many "corrections and Emendations", which, one maybe sure, will 
be based on what his subconscious mind expects or desires to find in the 
cleartext message. Unfortunately, another would-be cryptanalyst working 
upon the same cryptogram and hypothesis independently might conceivably 
"degarble" the cryptogram in different spots and produce an entirely 
dissimilar "plain jbext" as his "solution". Both "solutions" would be 
invalid because they are based upon an erroneous hypothesis--the crypto¬ 
gram actually happens to be a polyalphabetic substitution cipher which 
when correctly analyzed requires on the part of unbiased observers no 
assumption of garbles to a degree that strains their credulity. The 
last phrase Is added here because in professional cryptanalytic work it 
is very often necessary to make a few corrections for errors but it is 
rarely the case that the garble rate exceeds more than a few percent of 
the characters of th^ c^iptogram, say 5 to 10$ at the outside. It is 
to he noted, however, that occasionally the solution to a cryptogram 
may involve the correction of more than this percentage of errors, but 
the solution would he regarded as valid only if the errors can he shown 
to be systematic in some significant respect, or can otherwise he 
explained by objective rationalization. 
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be used to Impugn the integrity of the pse-ido-crypti na3.yst. The worst 
that can he said of him ic that La has 'become a victim of a special or 
peculiar form of self-delusion, and that his desire to solve the problem, 
usually in accord with some previously-formed opinion, or notion, has 
over-balanced, or undermined, his judgment and good sense.13 


■*■3 Specific reference can be made to the following typical "case 
histories": 

Donnelly, Ignatius, The Great Cryptogram . Chicago, 1888. 

Owen, Orville W., Sir Francis Bacon’s Cipher Story . Detroit, 1895• 
Gallup, Elizabeth Wells, Francis Bacon's Biliteral Cipher , 

Detroit, 1900, 

Arensbcrg, 'Walter Conrad, The Cryptography of Shakespeare . Los 
Angeles, 1922. 

The Shakespearean Mystery . Pittsburgh, 1928. 

The Baconian Keys . Pittsburgh, 1928. 

Margoliouth, D. S., The Homer of Aristotle . Oxford, 1923* 
^Newbold, William Romaine, The Cipher of Roger Bacon . Philadelphia, 
! / 1928. : (For a scholarly and complete demolition of Professor 

^ \ Rewbdld' , s work, see an article entitled Roger Bacon and the 


1 ? 


Voynich MS, by John M 
July I931.) 


Manly, in Speculum , Vol. VI, Ho. 3, 

The Shakespearean Cypher . Rochester, H. Y., 


/ Feely, JoBeph Martin, 
i 1931. 

1 Deciphering Shakespeare . Rochester, H. Y., 193k. 

Roger Bacon’s Cypher: the right key found . Rochester, H. Y., 19^3- 
Wolff, Werner, Dechiffrement de l*Ecriture Maya . Paris., *1938. 

/Strong, Leonell C., Anthony Askham, the author of the Voynich 
manuscript , in Science , Vol. 101, June 19, 19^-5, PP. 608-9. 
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SECTION II 


BASIC CRttPTOLOGIC CORSIDERATIONS 


Cryptology* ccmnmraicatlon Intelligence, and communication 

security. 

Secret communication.. r • *».»...*<;•» * 

Plain text and encrypted teat........••*•«*•.«••..» 

Cryptography, encrypting, ard decrypting.*.. 

Codes, ciphers, and enciphered code .»......... 

General system, specific hey, and cryptosystem* 

Cryptanalytics and cryptanalysis,.....*.«.*••»*»«.*... 

Transposition and substitution..... 

Nature of alphabets.*.... 

Types of alphabets....... 


Paragraph 
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11 
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h. Crg?tology t ^communication intelligence, and communicatio n 
security . The occasional' or frequent need for secrecy in the conduct of 
inportant affairs has been a*ecogniZed from time immemorial. In the case 
of diplomacy and organized irarfare this need is especially important in 
regard to communications. However, [when such communications are trans¬ 
mitted by electrical means, they can be heard and copied by unauthorized 
persons. The protection resulting from all measures designed to deny 
to unauthorized persons information of value ■which may be derived from 
such communications is called communication security . The evaluated 
information concerning the enemy, derived principally from a study of his 
electrical communications, :Ls called communication intelligence . The coin 
lective term including all phases of communication intelligence and com¬ 
munication security is cryptology . 1 Or, stated in broad terms, cryptology 
is that branch of Imovledge which treats of hidden, disguised, or secret* 
communications* 


—■ . ■ ■ ■ ' - 

J - Prom the Greek teryptoa (hidden) + logos (learning). The prefix 
"crypto-" in compound words pertains to' H cryptologic n , "cryptographic", 
or "cryptanalytic", depending upon the use of the particular .word as 
defined. 

2 In this text the term '‘secret" will be used in its ordinary sense 
as given in the dictionary. Whenever the designation is used in the more 
restricted sense of the security classification aa defined in official 
regulations, it will be capitalized. There are in current use the four 
classifications Restricted, Confidential, Secret, and Top Secret, listed _ 
In ascending order of degree* 
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g. Secret communication .--a. Ccmrarunlcatlon may be conducted by 
any means susceptible of ultimate Interpretation by one of tlxe five 
Benses, but those most commonly used are sight and hearing. Aside from 
the use of simple visual and auditory signals for communication over 
relatively short distances, the visual method of communication between 
or among individuals separated from one another by relatively long dis¬ 
tances involves, at one stage or another, the act of writing or of 
speaking over a telephone. 

b. Privacy or secrecy in communication by telephone can be obtained 
by using equipment which affects the electrical currents involved in te¬ 
lephony so that the conversations can be understood only by persons pro¬ 
vided with suitable equipment properly arranged for the purpose. The 
same thing is true in the case of electrical transmission of pictures, 
drawings, maps, and television Images* However, this text will not treat 
of these aspectg3 of cryptology. 

c. Writing may be either visible or invisible . In the former, the 
characters are inscribed with ordinary writing materials and can be seen 
with the naked eye; in the latter, the characters are inscribed by means 
or methods which make the writing invisible to the naked eye. Invisible 
writing can be prepared with certain chemicals called invisible, sympa¬ 
thetic, or secret inks, and in order to "develop" such writing, that is, 
make it visible, special processes must usually be applied. There are 
also methods of producing writing which is invisible to the naked eye 
because the characters are of microscopic size, thus requiring special 
photographic or microscopic apparatus to make such writing visible to the 
naked eye. 

d. Invisible writing and unintelligible visible writing constitute 
secreE writing . 

6. Plain text and encrypted text .—a. Visible writing which is 
intelligible, that is, conveys a more or less understandable or sensible 
meaning (in the language in which written) and which Is not intended to 
convey a hidden meaning, is said to he in plain texfc .^ A message in 
plain text is termed a plaintext; message, a cleartext message, or & 
message in clear . 


3 These aspects of cryptology are now known as clphony (from cipher + 
tele phony ): cifax (from cipher + facsi mile): and civlslon (from, cipher + 
tele vision ). 

^ Visible writing may be intelligible but the meaning it obviously 
conveys may not be its real meaning, that is, the meaning intended to be 
conveyed. To quote a simple example of an apparently innocent message 
containing a secret or hidden meaning, prepared with the Intention of es¬ 
caping censorship, the sentence "Son born today" may mean "Three trans¬ 
ports left today.” Messages of this type are said to be in open code . - 
Secret communication methods or artifices of this sort (concealment sys¬ 
tems) are impractical for field military use but are often encountered 
in espionage and counter-espionage activities* 
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b. Visible -writing which conveys no intelligible meaning in any 
recognized language5 is said to be in encrypted text and such writing 18 
termed a cryptogram.® _ 

J t l I l _ 9 

Jo Cryptography, enc r ypting, and decrypting .--a. Cryptography is 
that branch of cryptolo^’w;iich treats of "various means , methods, and 
apparatus for converting or transforming plaintext messages into crypto¬ 
grams and for reconverting the cryptograms into their original plaintext 
forms by a simple reversal of the steps used in their transformation. 

b. To encrypt is to convert or transform a plaintext message into 

a cryptogram by following certain rules, steps, or processes constituting 
the key or keys and agreed upon in advance by correspondents, or furnished 
them' by higher 'authority. 

c. aeerypt is> to reconvert or to transform a cryptogram into the 
original equivalent plaintext message by a direct reversal of the en¬ 
crypting process, that Is, by applying to the cryptogram the key or keys 
{usually in a reverse order) used in producing the cryptogram. 

d. A person skilled in the art of encrypting and decrypting, or one 
who has a part in devising a cryptographic system is called a crypto¬ 
grapher; a clerk who encrypts and decrypts, or who assists in such work, 
is called a cryptographic clerk . 

8. Codes, ciphers, arid enciphered code .—a. Encrypting and de¬ 
crypting are accomplished by means collectively designated as codes and 
ciphers . Such means are used for either or both of two purposes* (l) se¬ 
crecy, and (2) economy or brevity. Secrecy usually is far more important 
in military cryptography ttian economy or brevity. In ciphers or cipher 
systems, cryptograms are paroduced by applying the cryptographic treatment 
to individual letters of the plaintext messages, whereas, in codes or code 
systems , cryptograms are produced by applying the cryptographic treatment 
to entire words, phrases, and sentences of the plaintext messages. The 
specialized meanings of the terms ToSe and cipher are explained In detail. _ 
later. 

b. A cryptogram produced by means of a cipher system is said to be 
in cipher and is called a cipher message, or sometimes simply a cipher . 

Vhe act or operation of encrypting" a cipher message is called enciphering. 


^ There is a certain type of writing which is considered by its authors 
t6 be intelligible, but which is either completely unintelligible to the 
wide variety of readers or else requires considerable mental struggle on 
their part to make it intelligible. Reference is here made to so-called 
"modern literature" and "modern verse", products of such writers as 
E. E. Cummings, Gertrude Stein, James Joyce, et al. 

' ^ From kryptos + gramma '(that which is written). Analogous terminol¬ 
ogy would call a plaintext message a phanerogram (phaneros = visible, 
manifest, open). 
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and the enciphered version of the plain text, as well as the act or pro¬ 
cess Itself, is often referred to as the encipherment * The cryptographic 
clerk who performs the process serves as an encipherer . The corresponding 
terms applicable to the decrypting of cipher messages are deciphering, 
decipherment , and decipherer . A clerk who serves as both an encipherer 
and decipherer of messages is called a cipher clerk . 

c. A cipher device is a relatively simple mechanical contrivance for 
eneipEerment and decipherment, usually "hand-operated" or manipulated by 
the fingers, as for example a device with concentric rings of alphabets, 
manually powered; a cipher machine is a relatively complex apparatus or 
mechanism for encipherment and decipherment, usually equipped with a 
typewriter key board and often requiring an external power source. 

d. A cryptogram produced by means of a code system is said to be in 
code and is called a code message . The text of the cryptogram is referred 
to' as code text . This act or operation of encrypting, is called encoding, 
and the encoded version of the plain text, as well as the act or process 
itself, is referred to as the encodement . The clerk who performs the pro¬ 
cess serves as an encoder . The corresponding terms applicable to the 
decrypting of code messages are decoding , decodement, and decoder . A 
cryptographic clerk who serves as both an encoder and decoder of messages 
is called a code clerk . 

e. Sometimes, for special purposes (usually increased security), 
the code text of a cryptogram undergoes a further step in concealment 
involving superencryption , that is, encipherment of the characters com¬ 
prising the code text, thus producing what is called an enciphered-coda 
message, or enciphered code . Encoded cipher, that is, the case where the 
final cryptogram is produced by enciphering the plain text and then en¬ 
coding the cipher text obtained from the first operation, is also possible, 
but rare. 

9. General system, specific key, and cryptosystem .—a. There are 
a great many different meihods of encrypting messages, so that correspon¬ 
dents must first of all be in complete agreement as to which of them will 
be used in their secret communications, or in different types or classes 
of such communications. Furthermore, it is to be understood that all the 
detailed rules, processes, or steps comprising the cryptography agreed 
upon will be invariant, that is, constant or unvarying in their use in a 
given set of communications. The totality of these basic, invariable 
rules, processes, or steps to be followed in encrypting a message according 
to the agreed method constitutes the general cryptographic system or, more 
briefly, the general system . 

b. It is usually the case that the general system operates in con¬ 
nection with or under the control of a number, a group of letters, a word, 
a phrase, or sentence which is used as a key, that is, the element which 
specifically governs the manner in which the general system will be applied 
in a specific message, or the exact setting of a cipher device or a cipher 
machine at the initial point of encipherment or decipherment of a specific 
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message. This element—usually of a variable nature or changeable at the 
will of the correspondents, or prearranged for them by higher authority— 
is called the specific key . The specific key may also involve the use of 
a set of specially prepared tables, a special document, or even a book* 

c* The term cryptosystem! is used when it is desired to designate 
or refer to all the cryptomaterial (device, machine, instructions for use, 
key lists, etc.) as a unit to provide a single, complete system and means 
for secret communication. 

10. Cryptanalytics and cryptanalysis .—a. In theory any cryptosystem 

(except one°) can be "broken”, l.e., BOlved, if enough time, labor, and 
skill are devoted to it, and if the volume of traffic in that system is 
large enough. This can be done even if the general system and the speci¬ 
fic key are unknown at the start. In military operations theoretical 
rules must usually give way to practical considerations. How the theo¬ 
retical rule in this case is affected by practical considerations will 
be discussed in Appendix 11. 

b. That branch of cryptology which deals with the principles, 
methods, and means employed in the solution or analysis of cryptosystems 
is called cryptanalytic s . ..... ._ 

c* The steps and operations performed in applying the principles of 
cryptanalytics constitute c ryptanalysis . To eryptanalyze a cryptogram 
is to solve it by cryptanalysis. 

d. A person skilled in the art of cryptanalysis is called a crypt¬ 
analyst, and a clerk who assists in such work is called a cryptanalyblc 
clerk . .. 

11. Transposition and substitution .—a. Technically there are only 
two distinct type's of treatment wfaicli! may Fe applied to written plain 
text to convert it into secret text, yielding two different classes of 
cryptograms. In the first, called transposition, the elements or units 
of the'plain text retain their original identities and merely undergo 
some change in their relative positions, with the result that the original 
text becomes unintelligible. In the second, called substitution, the 
elements of the plain text retain their original relative positions but 
are replaced by other elements with different values or meanings, with 
the result that the original text becomes unintelligible. Thus, in the 
case of transposition ciphers, the unintelligibility is brought about 
merely by a change in the original sequence of the elements or units of 


7 The term cryptosystem is used in preference to cryptographic system 
so as to permit it's use in designating secret communication systems 
involving means other than writing, such as ciphony and cifax. 

® The exception is the Tone-time” system in which the key is used only 
once and in itself must have no systematic construction, derivation, or 
meaning. 
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the plain text; in the case of substitution ciphers, the unintelllgi- 
bility is brought about by a change in the elements or units themselves, 
■without a change in their relative order* 

b. It is possible to encrypt a message by a substitution method and 
then to apply a transposition method to the substitution text, or vice 
versa. Such combined transposition^substitution methods do not form a 
third class of methods . They are occasionally encountered in military 
cryptography, but the types of combinations that are sufficiently simple 
to be practicable for field use are very limited .9 

c. Under each of the two principal classes of cryptograms as out- 
lined"above, a further classification can be made based upon the number- 
of characters composing the textual elements or units undergoing crypto¬ 
graphic treatment. These textual units are composed of (l) individual 
letters, (2) combinations of letters in regular groupings, (3) combina¬ 
tions of letters in irregular, more or less euphonious groupings called 
syllables, and (4) complete words, phrases, and sentences. Methods which 
deal with the first type of units are called monographic methods} those 
which deal with the second type are called polygraphic (digraphic, tri¬ 
graphic, etc.); those which deal with the third type, or syllables, are 
called syllabic; and, finally, those which deal with the fourth type are 
called lexical (of or pertaining to words). 

d. It is necessary to indicate that the foregoing classification of 
cryptographic methods is more or less artificial in nature, and is estab¬ 
lished for purpose of convenience only. No sharp line of demarcation can 
be drawn in every case, for occasionally a given system may combine me¬ 
thods of treating single letters, regular or irregular-length groupings 
of letters, syllables, words, phrases, and complete sentences. When in 

a single system the cryptographic treatment is applied to textual units 
of regular length, usually monographic or digraphic (and seldom longer, 
or intermixed monographic and digraphic), the system is called a cipher 
system . Likewise, when in a single system the cryptographic treatment is 
applied to textual units of irregular length, usually syllables, whole 
words, phrases, and sentences, and Is only exceptionally applied to single 
letters or regular groupings of letters, the system is called a code 
system and generally involves the use of a code book . 1 *-* 

12. Nature of alphabets .—a. One of the simplest kinds of substitu¬ 
tion ciphers is that which is known in cryptologic literature as Julius 
Caesar's Cipher, but Which, as a matter of fact, was a favorite long 
before his day. In this cipher each letter of the text of a message is 
replaced by the letter standing the third to the right of it in the 


9 One notable exception is the ADFGVX system, used extensively by the 
Germans in World War I. See in this connection the Cryptographic 
Supplement (Appendix 7)* * 

10 A list of single letters, frequent digraphs, trigraphs, syllables, 
and words is often called a syllabary ; cryptographic treatment of the 
units of such syllabaries places them in the category of code systems. 
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ordinary alphabet; the letter A is replaced by D, the letter B by E, 
and so on. The word cab becomes converted into SDE, "which is cipher. 

b. The English langusige is written, by means' of 26 simple characters 
calleT letters which, taken together and considered as a sequence of 
symbols, constitute the alphabet of the language. Not all systems of 
writing are of this nature, Chinese writing is composed of about Mi-,000 
complex characters, each representing one sense of a word. Whereas 
English words are composite or polysyllabic and may consist of one to 
eight or.more syllables, Chinese words are all monosyllables and each 
monosyllable is e word. Written languages of the majority of other 
civilized peoples of today are, however, alphabetic and polysyllabic in 
construction, so that the principles discussed here apply to all of them. 

£. The letters comprising the English alphabet used today are the 
results of a long period of evolution, the couplete history of which may 
never fully he known. 11 They are conventional symbols representing 
elementary sounds, and any other simple symbols, so long as the sounds 
which they represent are agreed upon by those concerned, will serve the 
purpose equally well. If taught from early childhood that the symbols 
*, and @ represent the sounds "Ay", "Bee", and "See" respectively, the 
combination @$* would still be pronounced cab, and would, of course, have 
exactly the same meaning as before. Again, let us suppose that two per¬ 
sons have agreed to change the sound values of the letters F, G, and H, 
and after long practice have become accustomed to pronouncing them as we 
pronounce the letters A, B, and C, respectively; they would then write 
the "word" HFG, pronounce it cab , and see nothing strange whatever in the 
matter. But to others no party to their arrangements, BFG constitutes 
cipher. The combination of sounds called for by this combination of 
symbols is perfectly intelligible to the two who have adopted the new 
sound values for those synibols and therefore pronounce HFG as cab ; but 
HFG is utterly unpronounceable and wholly unintelligible to others who 
are reading it according to their own long-established system of sound 
and symbol equivalents. It would he stated that there is no such word 
as HFG, which would mean merely that the particular combination of sounds 
represented by this combination of letters has not been adopted by con¬ 
vention to represent a thing or an idea in the English language. Thus, 
it is seen that, in order for the written words of a language to be 
pronounceable and intelligible to all who speak that language, it is 
necessary, first, that the sound values of the letters or symbols be 
universally understood and agreed upon and, secondly, that the particular 
combination of sounds denoted by the letters should have been adopted to 
represent a thing or an idea. Spoken plain language consists of vocables; 
that is, combinations and permutations of elementary speech-sounds' whicE 
have by long usage come to he adopted and recognized as representing 
definite thingB and ideas. Written plain language consists of words; 
that is, combinations and permutations of simple symbols, calledTetters, 
which represent visually and call forth vocally the elementary speech- 
sounds of which the spoken language is composed. 


11 An excellent and most authoritative hook on this subject is The Al¬ 
phabet; a key to the history of Mankind by David Diringer. London,' 1^49'. 
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ordinary alphabet* the letter A is replaced by D, the letter B by E, 
and so on. The word cjib becomes converted into FDE, which is cipher. 

b. The English ljinguage is written by means of 2.6 simple characters 
c al ].el letters which, taken together and considered as a sequence of 
symbols, constitute the alphabet of the language. Not all systems' 'of 
writing are of this nature. Chinese writing is composed of about Mj-,000 
complex characters, each representing one sense of a word. Whereas , 
English words are composite or polysyllabic and may consist of one to 
eight or more syllables, Chinese words are all monosyllables and each 
monosyllable is s word. Written languages of the majority of other 
civilized peoples of today are, however, alphabetic and polysyllabic in 
construction, so that the principles discussed here apply to all of them. 

£. The letters comprising the English alphabet used today are the 
results of a long period of evolution, the complete history of which may 
never fully be known.^ They are conventional symbols representing 
elementary sounds, and any other simple symbols, so long as the sounds 
which they represent are agreed upon by those concerned, will serve the 
purpose equally well. If taught from early childhood that the symbols 
$, *, and @ represent bhe sounds "Ay", "Bee", and "See" respectively, the 
combination @$* would still be pronounced cab, and would, of course, have 
exactly the same meaning as before. Again, let us suppose that two per¬ 
sons have agreed to change the sound values of the letters F, G, and H, 
and after long practice have become accustomed to pronouncing them as we 
pronounce the letters A, B, and C, respectively; they would then write 
the "word" HFG, pronounce it cab, and see nothing strange whatever in the 
matter. But to others no party to their arrangements, HFG constitutes 
cipher. The combination of sounds called for by this combination of 
symbols is perfectly intelligible to the two who have adopted the new 
sound values for those symbols and therefore pronounce HFG as cab ; but 
HFG is utterly unpronounceable and wholly unintelligible to others who 
are reading it according to their own long-established system of sound 
and symbol equivalents. It would be stated that there is no such word 
as HFG, which would mean merely that the particular combination of sounds 
represented by this combination of letters has not been adopted by con¬ 
vention to represent a thing or an idea in the English language. Thus, 
it is seen that, in order for the written words of a language to be 
pronounceable and intelligible to all who speak that language, it is 
necessary, first, that the sound values of the letters or symbols be 
universally understood and agreed upon and, secondly, that the particular 
combination of sounds denoted by the letters should have been adopted to 
represent a thing or an idea. Spoken plain language consists of vocables; 
that is, combinations and permutations of elementary speech-sounds which, 
have by long usage come to be adopted and recognized as representing 
definite thingB and ideas. Written plain language consists of words; 
that is, combinations and permutations of simple symbols, called letters, 
which represent visually and call forth vocally the elementary speech- 
sounds of which the spoken language is composed. 


11 An excellent and most authoritative book on this subject is The Al- 
phabet; a key to the history of Mankind by David Diringer. London,' 19^9* 
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ordinary alphabet; the letter A is replaced by D, the letter B by E, 
and so on. The word cab becomes converted into IDE, which is cipher. 


b. The English language is written by means of 2 6 simple characters 
called letters which, taken together and considered as a sequence of 
symbols , constitute the alphabet of the language. Not all systems of 
writing are of this nature. Chinese writing is composed of about M*- ,000 
conplex characters, each representing one sense of a word. Whereas . 
English words are composite or polysyllabic and may consist of one to 
eight or more syllables, Chinese words are all monosyllables and each 
monosyllable is e word. Written languages of the majority of other 
.civilized peoples of today are, however, alphabetic and polysyllabic in 
construction, so that the principles discussed here apply to all of them. 


i 


c_. The letters comprising the English alphabet used today are the 
results of a long period of evolution, the complete history of which may 
never fully be known.^ They are conventional symbols representing 
elementary sounds, and any other simple symbols, so long as the sounds 
which they represent are agreed upon by those concerned, will serve the 
purpose equally well, if taught from early childhood that the symbols 
$, *, and @ represent the sounds "Ay", ’Dee", and "See" respectively, - the 
combination @$* would still be pronounced cab , and would, of course, have 
exactly the same meaning as before. Again, let us suppose that two per¬ 
sons have agreed to change the sound values of the letters F, G, and H, 
and after long practice have become accustomed to pronouncing them as we 
pronounce the letters A, B, and C, respectively; they would then write 
the "word" HFG, pronounce it cab, and see nothing strange whatever in the 
matter. But "fo others no party to their arrangements, HFG constitutes 
cipher. The combination of sounds called for by this combination of 
symbols is perfectly intelligible to the two who have adopted the new 
sound values for those symbols and therefore pronounce HFG as cab; but 
HFG is utterly unpronounceable and wholly unintelligible to others who 
are reading it according to their own long-established system of sound 
and symbol equivalents. It would be stated that there is no such word 
as HFG, which would mean merely that the particular combination of sounds 
represented by this combination of letters has not been adopted by con¬ 
vention to represent a thing or an idea in the English language. Thus, 
it is seen that, in order for the written words of a language to be 
pronounceable and intelligible to all who speak that language, it is 
necessary, first, that the sound values of the letters or symbols be 
universally understood and agreed upon and, secondly, that the particular 
combination of sounds denoted by the letters should have been adopted to 
represent a thing or an idea. Spoken plain language consists of vocables 
that is, combinations and permutations of elementary speech-sounds which, 
have by long usage come to be adopted and recognized as representing 
definite things and ideas, Written plain language consists of words; 
that is, combinations and permutations of simple symbols, called letters, 
which represent visually and call forth vocally the elementary speech- 
sounds of which the spoken language is composed. 


11 


An excellent and most authoritative book on this subject is The Al¬ 


phabet; a key to the history of Mankind by David Diringer. London,' 1^9* 
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d. It is clear also that in order to -write a polysyllabic language 
with facility it is necessary to establish and to maintain by common 
agreement or convention, equivalency between two sets of elements, first, 
a Bet of elementary sounds and, second, a set of elementary symbols to 
represent the sounds. When this is done the result is what is called an 
alphabet, a word derived from the names of the first two letters of the 
Greek alphabet, "alpha" and "beta". 

e. Theoretically, in an ideal alphabet each symbol or letter would 
denote only one elementary sound, and each elementary sound would inva¬ 
riably be represented by the same symbol. But such an alphabet would be 
far too difficult for the average person to use. It has been conserva¬ 
tively estimated that a minimum of 100 characters would he necessary for 
English alone. Attempts toward producing and introducing into usage a 
practical, scientific alphabet have been made, one being that of the 
Simplified Spelling Board in 1928, which advocated a revised alphabet of 
k2 characters* Were such an alphabet adopted into current usage, in 
books, letters, telegrams, etc., the flexibility of cryptographic systems 
would he considerably extended and the difficulties set in the path of 
the enemy cryptanalysts greatly increased. The chances for its adoption 
in the near future are, however, quite small. Because of the continually 
changing nature of every living language, it is doubtful whether an 
initially "perfect alphabet" could, over any long period of time, remain 
so and serve to indicate with great precision the exact sounds which it 
was originally designed to represent. 

13. Types of alphabets .—a. In the study of cryptography the dual 
nature of the alphabet J becbmes*"apparent. It consists of two parts or 
components, (l) an arbitrarily-arranged sequence of sounds, and (2) an 
arbitrarily-arranged sequence of symbols. 

b. The normal alphabet for any language is one in which these two 
components are the ’ordinary”sequences that have been definitely fixed by 
long usage or convention. The dual nature of our normal or everyday 
alphabet is often lost sight of. When we write A, B, C,... we really 
mean: 

Sequence of sounds; "Ay" "Bee" "See" .... 

Sequence of symbols: A B C .... 

Normal alphabets of different languages vary considerably in the number 
of characters composing them and the arrangement or sequence of the 
characters. The English, Dutch, and German alphabets each have 2 6 ; the 
French. 25; the Italian, 21; the Spanish, 27 (including the digraphs CH 
and Li); and the Russian, 31*^ The Japanese language has a syllabary 
consisting of 72 syllabic sounds which require 48 characters for their 
representation. 


^ In contrast to the foregoing alphabets, it is of interest to note 
that in the Hawaiian language the alphabet consists of only 12 letters, 
viz , the five vowels A, E, I, 0, U, and the seven consonants H, K, L, 

M, N, P, W. 
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c. A cipher alphabet, or substitution alphabet as it is sometimes 
called, is one in which"the elementary speech-sounds are represented by 
characters other than those representing them in the normal alphabet* 
These characters may be letters,, figures, signs, symbols* or combinations 
of them. 

d. When the plain text of a message is converted into encrypted 
text ’Ey the use of one or more cipher alphabets, the resultant cryptogram 
constitutes a substitution cipher . If only one cipher alphabet is in¬ 
volved, it is called a m onoalphabetic substitution cipher; if two or more 
cipher alphabets are involved, it is called a polyalphabetlc substitution 
cipher . 

e. It is convenient to designate that component of a cipher alpha¬ 
bet constituting the sequence of speech-sounds as the plain component 
and the component constituting the sequence of symbols as the cipher 
component . If omitted In a cipher alphabet, the plain component is 
understood to be the normal sequence. For brevity and clarity, a letter 
of the plain text, or of the plain component of a cipher alphabet, is 
designated by suffixing a small letter ”p w to it: Ap means A of the 
plain text, or of the pltdn component of a cipher alphabet. Similarly, 
a letter of the cipher text, or of the cipher component of a cipher 
alphabet, will be designated by suffixing a small letter "c* to it: 

Xq means X of the cipher text, or of the cipher component of a cipher 
alphabet. The expression Ap = Xq means that A of the plain text, or A 
of the plain component of a cipher alphabet, is represented by X in the 
cipher text, or by X in the cipher component of a cipher alphabet. 

f. With reference i;o the arrangement or sequence of letters forming 
their”components, cipher alphabets are of two types: 

(1) Standard cipher alphabets, in which the sequence of letters in 
the plain component is tie normal^ and in the cipher component is the 
same as the normal, but reversed in direction or shifted from its normal 
point of coincidence with the plain component. 

(2) Mixed cipher alphabets, in which the sequence of letters or 
characters in one or both, of the components is no longer the same as 
the normal in its entirety. 

g. Although the basic considerations of the preceding paragraphs 
place~the student in a position to undertake the study of certain funda¬ 
mental principles of cryptanalysis, this may be a good point at which to 
pause and to make a few remarks with regard to the role that cryptanaly¬ 
sis plays in the whole chain of more or less complex operations involved 
in deriving communication intelligence, after which these fundamental 
cryptanalytlc principles will be treated. 
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l4. The role of cryptanalysis in communication intelligence 
operations .—a. Through the medium of communication intelligence an at¬ 
tempt is made"to answer three questions concerning enemy communications: 
’’Who?". "Where?" "What?"— Who are theii 1 originators and addressees? 

Where are these originators and addressees located? What do the messages 
say? 

b. All of the foregoing questions are very important in the military 
application of communication intelligence. Hence, even though this text 
deals almost exclusively with the principles and operations involved in 
deriving the answer to the third question—"What do the messages say?" — 
a few words on the importance of the first and second questions may be 
useful. It is a serious mistake to think that one can necessarily and 
always correctly interpret the mere text of a message without identifying 
and locating the originator and the addressee or, on many occasions, with¬ 
out having a background against which to interpret the message in order 
to appreciate its real import or significance. 

£. The very first step in the series of activities involved In de¬ 
riving communication intelligence is the collection of the raw material, 
that is, the interception 3 - and copying of the transmissions constituting 
the messages to be studied and analyzed. 


d. Then, with the raw material in hand, studies are made in order 
to answer the first two questions—"Who?" and "Where?" The answers to 
these questions are not always obvious in modern military communications, 
especially in the case of messages exchanged by units in the combat zone, 
since messages of this sort rarely indicate in plain language who the 


1 T° intercept means, in its cryptologic sense, to gain possession of 
communications which are intended for other recipients, without obtaining 
the consent of these addressees and without preventing or ordinarily 
delaying the transmission of the communications to them. 
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originator and the addressee are or where they are located. Consequently, 
certain apparatus and techniques specifically developed for finding the 
answers to these questions must he employed. These apparatus and tech¬ 
niques are embraced by that part of communication intelligence theory and 
practice which is known as traffic analysis . This latter subject and 
interception are treated briefly in Appendix 10, ’’Communication intelli¬ 
gence operations". (The serious student will derive much practical 
benefit from a careful reading of this appendix.) 

e. The foregoing operations, interception ahd traffic analysis, 
along"*with cryptanalysis constitute the first three operations of commu¬ 
nication intelligence. But generally there must follow at least one addi¬ 
tional operation. If the plain texts recovered through cryptanalysis are 
in a foreign language, they must usually be translated, and translation 
constitutes this fourth operation. In the course of translating, it may 
be found that, because of errors in transmission or reception, corrections 
and emendations must be made in these plain texts; however, although this 
often requires skill and experience of a high order, it does not consti¬ 
tute another communication intelligence operation, since it is but an 
auxiliary step to the process of translation. 

f. In a large-scale communication intelligence effort these four 
steps, interception, traffic analysis, cryptanalysis, and translation, 
must be properly organized and coordinated in order to gain the most bene¬ 
fit from the potentialities of communication intelligence,“that is, the 
production of the maximum quantity of information from the raw traffic. 
This information must then be evaluated by properly trained intelligence 
specialists, collated with Intelligence derived from other sources, and, 
finally, disseminated to the commanders who need the intelligence in time 
to be of operational use to them, rather than of mere historical interest. 
The foregoing operations and especially the first three—interception, 
traffic analysis, and cryptanalysis—usually complement one another. 

This, however, is not the place for elaboration on the interrelationships 
which exist and which when properly integrated make the operations as a 
whole an efficient, unified complex geared to the fulfillment of its prin¬ 
cipal goal, namely, the production of timely communication intelligence. 

g. With the foregoing general background, the student is prepared 

to proceed to the technical considerations and principle's of cryptanalysis. 

15. The four basic operations in cryptanalysis .—a. The solution of 
practically every cryptogram involves four fundamental operations or steps: 

(1) The determination of the language employed In the plaintext 
version. 

(2) The determination of the general system of cryptography employed. 

(3) The reconstruction of the specific key in the case of a cipher 
system, or the reconstruction, partial or complete, of the code book, in 
the case of a code system; or both, in the case of an enciphered code 
system. 

(4) The reconstruction or establishment of the plain text. 
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b. These operatiors will "be taken up in the order in which they are 
given above and in which they usually are performed in the solution of 
cryptograms, although occasionally the second step my precede the first 

- 16. The determination of the language employed .—a. There is not 
much that need "be said with respect to this operation except that the 
determination of the language employed seldom comes into question in the 
case of studies made of the cryptograms of an organized enemy. By this 
is meant that during wartime the enemy is of course known, and it follows, 
therefore, that the language he employs in his messages will almost cer¬ 
tainly he his native or mother tongue. Only occasionally nowadays is 
this rule broken. Formerly it often happened, or it might have indeed 
been the general rule, that the language used in diplomatic correspondence 
was not the mother tongue, hut French. In isolated instances during 
World War I the Germans used English when their own language could for 
one reason or another not he employed. For example, for a year or two 
before the entry of the United States into that war, during the time 
America was neutral and the German Government maintained its embassy in 
Washington, some of the messages exchanged between the Foreign Office in 
Berlin and the Embassy in Washington were encrypted in English, and a 
copy of the code used was deposited with the Department of State and our 
censor. Another instance is found in the case of certain Hindu conspira¬ 
tors who were associated with and partially financed by the German Govern¬ 
ment in 1915 and 1916; they employed English as the language of their 
cryptographic messages„ Occasionally the cryptograms of enemy agents may 
be in a language different from that of the enemy. But in general these 


1 Although the foregoing four steps represent the classical or ideal 
approach to cryptanalysis, the art may he reduced to the following: 

Procedures Ln cryptanalysis 

1. Arrangement aad rearrangement of 
data to disclose non-random cha¬ 
racteristics or manifestations 
(i.e., in frequency counts, re¬ 
petitions, patterns, symmetrical 
phenomena, etc.). 

2. Recognition of the non-random 
characteristics or manifestations 

. when disclosed. 

3. Explanation of the non-random 
characteristics when recognized. 

In all of the foregoing, the element of luck plays a very important part, 
as it is possible to side-step a large amount of labor and effort, in 
many cases, if "hunches" or intuition lead the analyst forthwith to the 
right path. Therefore, the phrase "or luck" should he added to each of 
the requirements above. 

In fact, it all boils down to the simple statement: "Find something 
significant, and attach some significance thereto." 


Requirements 

Experience or ingenuity, 
and time (which latter may 
be appreciably lowered by 
the use of machine aids in 
cryptanalysis). 

Experience or statistics. 


Experience or imagination, 
and intelligence. 
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are, as has been said, isolated instances; as a rule, the language used 
in cryptograms exchanged between members of large organizations is the 
mother tongue of the correspondents. Where this is not the case, that is, 
when cryptograms of unknown origin must be studied, the cryptanalyst 
looks for any indications on the cryptograms themselves which may lead to 
a conclusion as to the language employed. Address, signature, and other 
data, if in plain text in the preamble, in the body, or at the end of the 
cryptogram, all come under careful scrutiny, as well as all extraneous' 
circumstances connected with the manner j.r which the cryptograms were 
obtained, the person on whom they were found, or the locale of" their _ 
origin and destination. 

b. In special cases, or under special circumstances a clue to the 
language employed is found in the nature and composition of the crypto¬ 
graphic text itself. For example, if the letters K and W are entirely 
absent or appear very rarely in messages, it may indicate that the lan¬ 
guage is Spanish, for these letters are absent in the alphabet of that 
language and are used only to spell foreign words or names. The presence 
of accented letters or letters marked with special signs of one sort or 
another, peculiar to certain languages, will sometimes indicate the lan¬ 
guage used. The Japanese Morse telegraph alphabet and the Russian Morse 
telegraph alphabet contain combinations of dots and dashes which are 
peculiar to those alphabets and thus the interception of messages con¬ 
taining these special Morse combinations at once indicates the language 
Involved. Finally, there are certain peculiarities of alphabetic lan¬ 
guages which, in certain types of cryptograms, viz ., pure transposition, 
give clues as to the language used. For example, the frequent digraph CH, 
in German, leads to the presence, in cryptograms of the type mentioned, 

of many isolated C's and H's; if this is noted, the cryptogram may be 
assumed to be in German. 

c. In some cases it is perfectly possible to perform certain steps 
in cryptanalysis before the language of the cryptogram has been definitely 
determined. Frequency studies, for example, may be made and analytic 
processes performed without this knowledge, and by a cryptanalyst wholly 
unfamiliar with the language even if it has been identified, or who knows 
only enough about the language to enable him to recognize valid combina¬ 
tions of letters, syllables, or a few common words in that language. He 
may, after this, call to his assistance a translator who may not be a 
cryptanalyst but who can materially aid in making necessary assumptions 
based upon his special knowledge of the characteristics of the language 

in question. Thus, cooperation between cryptanalyst and translator 
results in solution.^ 


— The writer has seen in print statements that "during World War . decoded messages in Japanese 
and Russian without knowing a word of either language." The ext it to which such statements arc exaggerated 
will soon become obvious to the student. Of course, there are occasional instances in which a mere clerk with 
xylite limited experience may be able to “solve" a message m an extremely simple system in a language of which 
he has no knowledge at all; but such a “solution” calls for nothing more arduous than Vhe ability to recognize 
pronounceable combinations of vow-cls and consonants—an ability that hardly deceives to be rated as “cr;pt- 
analytic” in any real sense. To say that it is possible to solve a cryptogram in a foreign language “without 
knowing a word of that language" is not quite the same as to say that it is possible to do so wdth only a slight 
knowledge of the language; and it may be stated without cavil that the better the cryptanalyst's knowledge of 
the language, the greater arc the chances for his success and, in any case, the easier is his work. 
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17» The determination of the general system .—a. Except in the case 
of the more simple types of cryptograms, the step referred to as diagnosis , 
that is, ascertaining the general system according to which a given cryp¬ 
togram has heen produced is usually a difficult, if not the most difficult, 
step in its solution. The reason for this is not hard to find. 

b. As will become apparent to* the student as he proceeds with his 
study, in the final analysis, the solution of every cryptogram involving 
a form of substitution depends upon its reduction to monoalphabetic terms, 
if it is not originally in those terms . This is true not only of ordinary 
substitution ciphers, but also of combined substitution-transposition 
ciphers, and of enciphered code. If the cryptogram must be reduced to 
monoalphabetic terms, the manner of its accomplishment is usually indi¬ 
cated by the cryptogram itself, by external or internal phenomena which 
become apparent to the cryptanalyst as he studies the cryptogram. If 
this is impossible, or too difficult, the cryptanalyst must, by one means 
or another, discover how to accomplish thi3 reduction, hy bringing to 
bear all the special or collateral information he can get from all the 
sources at his command. If both these possibilities fail him, there is 
little left but the long, tedious, and often fruitless process of elimi¬ 
nation. In the case of transposition ciphers of the more complex type, 
the discovery of the basic method is often simply a matter of long and 
tedious elimination of possibilities. For cryptanalysis has unfortunately 
not yet attained, and may indeed never attain, the precision found today 
in qualitative analysis in chemistry, for example, where the analytic 
process is absolutely clear-cut and exact in its dichotomy. A few words 
in explanation of what is meant may not be amiss. When a chemist seeks 
to determine the identity of an unknown substance, he applies certain 
specific reagents to the substance and in a specific sequence. The first 
reagent tells him definitely into which of two primary classes the unknown 
substance falls. He then applies a second test with another specific 
reagent, which tells him again quite definitely into which of two second¬ 
ary classes the unknown substance falls, and so on, until finally he has 
reduced the unknown substance to Its simplest terms and has found out 
what it is. In striking contrast to this situation, cryptanalysis affords 
exceedingly few "reagents” or tests that may be applied to determine posi¬ 
tively that a given cipher belongs to one or the other'of two systems 
yielding externally similar results. And this is what makes the analysis 
of an isolated, complex cryptogram so difficult. Note the limiting adjec¬ 
tive "isolated” in the foregoing sentence, for it is used advisedly. It 
Is not often that the general system falls to disclose itself or cannot 
be discovered by painstaking investigation when there is a great volume 
of text accumulating from a regular traffic between numerous corre¬ 
spondents in a large organization. Sooner or later the system becomes 
known, either because of blunders and carelessness on the part of the 
personnel entrusted with the encrypting of the messages, or because the 
accumulation of text itself makes possible the determination of the 
general system by cryptanalytic, including statistical, studies. But in 
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the case of a single or even a few isolated cryptograms concerning which 
little or no information can he gained hy the cryptanalyst, he is often 
unable, without a knowledge of, or a shrewd guess as to the general system 
employed, to decompose the heterogeneous text of the cryptogram into 
homogeneous, monoalphabetic text, which is .he ultimate and essential 
step in analysis. The only knowledge that the cryptanalyst can bring to 
his aid in this most difficult step is that gained by long experience and 
practice in the analysis of many different types of systems. In this 
respect the practice of cryptanalysis is analogous to the practice of 
medicine: correct diagnosis is the most important and often the most 
difficult first step toward success. 

ic. On account of the complexities surrounding this particular phase 
of cryptanalysis, and because in any scheme of analysis based upon suc¬ 
cessive eliminations of alternatives the cryptanalyst can only progress 
as far as the extent of his own knowledge of all the possible alternatives 
will permit, it is necessary that detailed discussion of the eliminative 
process be postponed until the student has covered most of the field. 

For example, the student will perhaps want to know at once how he can 
distinguish between a cryptogram that is in code or enciphered code from 
one that is in cipher. It is at this stage of his studies impracticable 
to give him any helpful indications on bis question. In return It may be 
asked of him why he should expect to be able to do this in the early 
stages of his studies when often the experienced expert cryptanalyst is 
baffled on the same score! 

d. Nevertheless, in lieu of more precise diagnostic tests not yet 
discovered, a general guide that may be useful in cryptanalysis will be 
built up, step by step as the student progresses, in the form of a series 
of charts comprising what may be designated An Analytical. Key for Crypt¬ 
analysis . (See Section X.) It may be of assistance to the student if, 
as he proceeds, he will carefully study the charts and note the place 
which the particular cipher he is solving occupies in the general crypt- 
analytic panorama. These charts admittedly constitute only very brief 
outlines, and can therefore be of but little direct assistance to him 

in the analysis of the more complex types of cryptosystems he may en¬ 
counter later on. So far as they go, however, they may be found to be 
quite useful in the study of elementary cryptanalysis. For the expe¬ 
rienced cryptanalyst they can serve only as a means of assuring that no 
possible step or process is inadvertently overlooked in attempts to solve 
a difficult cryptosystem. 

e. Much of the labor involved in cryptanalytic work, as referred 
to In’*par. 2, is connected with this determination of the general system. 
The preparation of the text, its rewriting in different forms, sometimes 
being rewritten in dozens of ways, the recording of letters, the estab¬ 
lishment of frequencies of occurrences of letters, comparisons and 
experiments made with known material of similar character, and so on, 
constitute much labor that is most often indispensable, but which 
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sometimes turns out to have teen wholly unnecessary, or in vain. In one 
treatise^ it is stated quite boldly that "this work once done, the deter¬ 
mination. of the system is often relatively easy.” This statement can 
certainly spply only to the simpler types of cryptosystems? it is entirely 
misleading as regards the much more frequently encountered complex 
cryptograms of modern times. 

18. The reconstruction of the specific key .--a.- Nearly all practi¬ 
cal cryptographic methods require the use of a specific key to guide, 
control, or modify the various steps under the general system. Once the 
latter has been disclosed, discovered, or has otherwise come into the 
possession of the cryptanalyst, the next step in solution is to determine, 
if necessary and if possible, the specific key that was employed to en¬ 
crypt the message or messages under examination. This determination may 
not be in complete detail? it may go only so far as to lead to a know¬ 
ledge of the number of alphabets involved in a substitution cipher, or 
the number of columns involved in a transposition cipher, or that a one- 
part code has been used, in the case' of a code system. But it is often 
desirable to determine the specific key in ns complete a form and with 

as much detail as possible, for this information will very frequently be 
useful in the solution of subsequent cryptograms exchanged between the 
same correspondents, since the nature or source of the specific key in a 
solved case may be expected to give clues to the specific key in an 
unsolved case. 

b. Frequently, however, the reconstruction of the key is not a 
prerequisite to, and does not constitute an absolutely necessary preli¬ 
minary step in, the fourth basic operation, via ., the reconstruction or 
establishment of the plain text. In many cases, Indeed, the two processes 
are carried along simultaneously, the one assisting the other, until in 
the final stages both 1 have been completed in their entireties. In still 
other cases the reconstruction of the specific key may follow the recon¬ 
struction of the plain-text instead of preceding it and is accomplished 
purely as a matter of academic interest? or the specific key may, in 
unusual cases, never be reconstructed. 

19. The reconstruction of the plain text .--a. Little need be said 
at this point on this phase of cryptanalysis. The process usually con¬ 
sists, in the case of substitution ciphers, in the establishment of 
equivalency between specific letters of the cipher text and the plain 
text, letter by letter, pair by pair, and so on, depending upon the par¬ 
ticular type of substitution system Involved. In the case of transposi¬ 
tion ciphers, the process consists in rearranging the elements of the 
cipher text, letter by letter, pair by pair, or occasionally word by 
word, depending upon the particular type of transposition system involved, 
until the letters or words have been returned to their original plaintext 
order. In the case of code, the process consists in determining the 
meaning of each code group and inserting this meaning in the code text to 
reestablish the original plain text. 


3 Lange et Soudart, op, cit .. p. 106. 
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b. The foregoing processes do not, as a rule, begin at the begin¬ 
ning of a message and continue letter by letter, or group by group in 
sequence, up to the very end of the message. The establishment of values 
of cipher letters in substitution methods, or of the positions to which 
cipher letters should be transferred to form the plain text in the case 
of transposition methods, comes at very irregular intervals in the pro¬ 
cess * At first only one or two values scattered here and there through¬ 
out the text may appear; these then form the "skeletons" of words, upon 
which further work, by a continuation of the reconstruction process, is 
made possible; in the end the complete or nearly complete 1 ** text is 
established. 

c. In the case of cryptograms in a foreign language, the transla¬ 
tion of the solved messages is a final and necessary step, but is not to 
be considered as a cryptanalytic process. However, it is commonly the 
case that the translation process will be carried on simultaneously with 
the cryptanalytic, and will aid the latter, especially when there are 
lacunae which may be filled in from the context. (See also subpar._ l6e 
in this connection.) 

20. The utilization of traffic Intercepts .5—a. There are, of 
course, other operations which are not as basic in'"nature as those Just 
outlined but which must generally be performed as preliminary steps in 
practical cryptanalytic work (as distinguished from academic cryptana¬ 
lysis)'. 1 " Before a military cryptanalyst can begin the analysis of an 
enemy cryptosystem, it is necessary for him to study the intercept mate¬ 
rial that is available to him, isolate the messages that hve been 
encrypted by means of the cryptosystem to be exploited, and to arrange 
the latter in a systematic order for analysis. This work, although 
apparently very simple, may require a great deal of time and effort. 

b. Since, whenever practicable, two or more intercept stations 
are assigned to copy traffic emanating from the stations of one enemy 
radio net, it is natural that there should be a certain amount of 
duplication in the work of the several stations. This is desirable 
since it provides the cryptanalysts with tiro or more sets of the same 
messages, so that when one intercept station fails to receive all the 
messages completely and correctly, because of radio difficulties, local 
static, or poor operation, it is possible by studying the other sets to 
reconstruct accurately the entire traffic of the enemy net. 

^ Sometimes in the case of code, the meaning of a small percentage 
of the code groups occurring in the traffic may be lacking, because 
there is insufficient text to establish their meaning. 

5 A traffic intercept is a copy of a'communication gained through 
interception. 
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c r In all intercept activities where operators are used for copying 
the traffic, one of the most likely errors to "be found is caused hy the 
human element in reception. For this reason cryptanalysts and their 


Ltrs. 

and 

Figs, 


A 

B 

C 

D 

E 

F 

a 

H 

i 

j 

K 

L 

M 

N 

0 

P 

Q 

R 


Morse 

equi¬ 

valent 


• • wm M 
MM • 

• m # • 

• • 

# m mm 

• m « • 


Frequant Errors 


i, m, t, et 
d, ts 

f, k, r, nn 
^ s, 1, ti 


h, . 

t, i 

r, in 

m, o, z, me 

s, v, b, ii, se 
a, n, s 

v, o, am, eo 
d, o, ta 
r, d, ed 

A M itllli 


* ^ V% J VVI 

a, n, tt 
i, m, t, te 
g, k, v, mt 
J, g, 1, v, 
o, x, z, ma 


an 

o, x, z, ma 
,, f, g, 1, n, s. 


w 


Ltrs. 

and 

Figs. 


S 

T 

U 

V 
W 
X 

Y 
Z 
1 
a 

3 

4 

5 

6 

7 

8 

$ 


Morse 

equi¬ 

valent 


• • — 

MM • « 


Frequent Errors 


h, d, i, r, u 
a, e, n 
a, s, v, it 
h, u, x, st 

а, m, o, r, u, at 
v, k, y, tu 

x, c, nrn 
h, g, q, mi 

0, a 

2, 4 

3, 5 

4, 6 

5, 7 

б, 8 

T, 9 
8 , 0 
9, 1 


Chart 1. Most common errors In telegraphic transmission. 

assistants should be familiar with the international Morse alphabet and 
the most common errors in wire and radio transmission methods so as to be 
able to correct garbled groups when they occur. In this connection, 

Chart 1, above, will be found useful. 



29 
























REF ID:A56892 


REGTRICTE& 


SECTION IV 

FREQUENCY DISTRIBUTIONS AND THEIR FUNDAMENTAL USES 


Paragraph 


The simple or uniliteral frequency distribution....... 21 

Important features of the normal unilateral frequency 

distribution....... 22 

Constancy of the standard or normal uniliteral frequency 

distribution......... 23 

The three facts which can "be determined from a study of the 

uniliteral frequency distribution for a cryptogram........... 2*t 

Determining the class to which a cipher belongs..................... 25 

Determining whether a substitution cipher is monoalphabetic or 

non-monoalphabet ic.... 26 

The (phi) test for determining monoalphabeticity.. 27 

Determining whether a cipher alphabet Is standard (direct 

or reversed) or mixed........ 28 


21. The simple or unillteral frequency distribution .—a. It has 
long been known to cryptographers and typographers that the”letters com¬ 
posing the words of any intelligible written text composed in any language 
which is alphabetic in construction are employed with greatly varying 
frequencies. For example, if on cross-section paper a simple tabulation, 
shown in Fig. 1, called a unillteral frequency distribution , is made of 
the letters composing the words of the preceding sentence, the variation 
in frequency is strikingly demonstrated. It is seen that whereas certain 
letters, such as A, E, I, N, 0, R, and T, are employed very frequently, 
other letters, such as C, G, H, L, P, and S are employed not nearly so 
frequently, while still other letters, such as F, J, K, Q, V, X, and Z 
are employed either seldom or not at all. 


g =s ^ g III 5 lls = 

abcdIfghijklmnopqrstuvwxyz 


14 a 8' 4 22 2 t 10 U 


1 » 3 17 14 8 1 13 10 208 1 fi 1 7 


(Total=200 letters) 


Figure 1. 


b. If a similar tabulation is now made of the letters comprising the 
words”of the second sentence in the preceding subparagraph, the distribu¬ 
tion shewn in Fig. 2 i3 obtained. Both sentences have exactly the same 
number of letters (200). 
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abcdefghijklmnopqrItuvwxyz 


12 2872174 5 20 0 1 86 17 14 6 2 13 14 17 8 1 2 130 

(Total=200 letter^? 

Figure 2. 

c. Although each of these two distributions exhibits great varia¬ 
tion in the relative frequencies with which different letters are employed 
in the respective sentences to' which they apply, no marked differences 
are exhibited between the frequencies of the same letter in the two dis¬ 
tributions, Compare, for example, the frequencies of A, B, C . . . Z in 
Fig, 1 with those of A, B, C . . . Z in Fig. 2. Aside from one or two 
exceptions, as in the case of the letter F, these two distributions agree 

rather strikingly, 

«. 

d. This agreement, or similarity , would be practically complete if 
the two texts were much longer, for example, five times as long. In fact, 
when two texts of similar character, each containing more than 1,000 let¬ 
ters, are compared, it would be found that the respective frequencies of 
the 26 letters composing the two distributions show only very slight dif¬ 
ferences. This means, in other words, that in normal plain text each 
letter of the alphabet occurs with a rather constant or characteristic 
frequency which it tends to approximate, depending upon the length of the 
text analyzed. The longer the text (within certain limits), the closer 
will be the approximation to the characteristic frequencies of letters 

in the language Involved. However, when the amount of text being ana¬ 
lyzed has reached a substantial volume (roughly, 1,000 letters), the prac¬ 
tical gain in accuracy does not warrant further increase in the amount 
of text.-*- 

e. An experiment along these lines will be convincing. A series 
of 260 official telegrams^ passing through the Department of the Army 
Message Center was examined statistically.' The messages were divided 
into five sets, each totaling 10,000 letters, and the five distributions 
shown in Table 1-A, were obtained. 

1 See footnote 5, page 38 . 

2 These comprised meBBages from several official sources in addition 
to the Department of the Army and were all of an administrative character. 
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Table 1-A .—Absolute frequencies of letters appearing in five sets of Governmental plain-text telegrams, 
each set containing 10,000 letters, arranged alphabetically 


Set No. 1 

Set No. 2 

Set Nd, 8 

Set No. 4 

Set No. 5 

Letter 

Absolute 

Frequency 

Letter 

Absolute 

Frequency 

Letter 

Absolute 

Frequency 

Letter 

Absolute 

Frequency 

Letter 

Absolute 

Frequency 

A 

738 

A 


A_. _ 

681 

A 

740 

A._ 

741 

R 

104 

R 


R 

98 

R 

83 

B 

99 

n 

319 

fi 

300 

C. 

288 

C. 

826 

c.. 

301 

n 

387 

n 

413 

D.. 

423 

D 

461 

D 

448 

e 

1,367 

E. 

1,294 

E 

1,292 

E 

1,270 

E 

1,276 

E 

253 

F_. 

287 

F . . 

308 

F 

287 

F 

281 

a 

166 

a 

176 

g 

161 

G 

167 

G 

150 

H 

310 

H 

351 

H 

835 

R... 

349 

H 

349 

I.. 

742 

I 

750 

T 

787 

I_ 

700 

I_ 

697 

j _ 

18 

J. 

17 

.T 

10 

J_ .... 

21 

J. .. 

16 

K 

36 

KL„ _ 

38 

K 

22 

K_ 

21 

K.___ _ 

31 

L..- _ 

365 

L._ 

893 

L 

388 

L _ 

386 

L._ 

844 

M 

242 

M 

240 

M 

238 

M 

249 

M-_ 

268 

N 

786 

N 

794 

N 

815 

N. 

800 

N . _ _ 

780 

n 

685 

— 

0_, 

770 


791 

O 

766 

O _ 

762 

p 

241 

p„_ 

272 

P._ 

817 

P. _ 

245 

P— ^ 

260 

q J 

40 

Q_ _ 

22 

Q i 

46 

Q__ 

88 

Q,_L.| 

80 

R 

760 

R 

745 

R 

762 

R. _ 

786 

R_ 

786 

53 

668 

S 

683 

S_ 

585 

S 

628 

S .._ 1 

604 

T 

936 

T_ 

879 

T 

894 

T. 

968 

T._ 

928 

TT 

270 

II 

233 

U__ 

312 

u' 

247 

U. 

288 

V 

163 

V_ 

173 

V _ ... 

142 

V 

133 

V_ 

155 

W 

166 

ft._ 

163 

w _ 

186 

w_ 

183 

w 

182 

X 

43 

X_ _ 

60 

1 X . .. . 

44 

X. 

63 

X._ 

41 

Y._ 

191 

Y 

166 

Y_ 

179 

Y___ 

213 

Y 

229 

Z 

14 


17 

2L_ 

2 

z_ ..... 

11 

Z_ 

6 











TotaL_ 

10,000 


10,000 


10,000 


10,000 









I 

f, If the five distributions in Table 1-A are summed, the resuits 
are as shown in Table 2-A. \ 


Table 2-A .—Absolute frequencies of letters appearing in the combined five sets of messages totaling 

SO,000 letters, arranged alphabetically 


A.... 

3,683 

G..„. 

819 

L_ 

.. 1 ,821 

0- 

175 

V._ 

766 

B„„ 

487 

H. 

.. 1,694 

M_ 

.. 1 ,237 

FL_ 

3,788 

W._ 

*780 

C__._ 

1,534 

I_ 

.. 3,676 

N_ 

.. 3 ,975 

S_ 

3,058 

X._ 

231 

D_„_ 

2, 122 

J_ 

82 

O_ 

.. 3,764 

T_ 

4, 595 

Y._ 

967 

E__._ 

F.„_ 

6,498 

1,416 

K. 

24 8 

P._ 

.. 1 ,335 

U._ 

1,300 

Z._ 

49 
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g. The frequencies noted in Table 2-A above, when reduced to the 
basis of 1,000 letters and then ’■'sed as a basis for constructing a simple 
chart that will exhibit the variations in frequency in a striking manner, 
yield the following distribution which is hereafter designated as the 
normal or standard uniliteral frequency distribution for English 
telegraphic plain text: 

g 


g 



ABCDEFGHIJKLMNOPQRSTUVWXYZ 


74 10 31 42 130 28 IS 34 74 2 3 80 25 79 75 27 8 70 61 92 20 16 10 S 19 1 



Figure 3. 


22. Important features of the normal uniliteral frequency distri ¬ 
bution .—-a. When the distribution shown in Fig. 3 is studied in detail, 
the following features are apparent: 

(1) It is quite irregular in appearance. This is because the letters 
are used with greatly varying frequencies, as discussed in the preceding 
paragraph. This irregular appearance is often described by saying that 
the distribution shows marked crests and troughs , that is, points of high 
frequency and low frequency. 

(2) The relative positions in which the crests and troughs fall 
within the distribution, that is, the spatial relations of the crests and 
troughs, are rather definitely fixed and are determined by circumstances 
which have been explained in subpar. 13b. 

(3) The relative heights and depths of the crests and troughs within 
the distribution, that is, the linear extens i ons of the lines marking the 
respective frequencies, are also rather, definitely fixed, as would bo 
found if an equal volume of similar text were analyzed. 
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(**•) The most prominent crests are marked "by the vowels A, E, I, 0, 
and hhe consonants IT, R, S, T; the most prominent troughs are marked "by 
the consonants J, K, Q, X, and Z. 

(5) The important data are summarized in tabular form in Table 3* 


Table 3 




Frequency 

Percent of 
total 

Percent of 
total in 
round 
numbers 

6 YowaIs! A R T 0 U Y ,, __ . _ 

398 

39.8 

40 

20 Consonants: 

5 High Frequency (p N R f? T) - -. 

350 

35.0 

35 

10 Medium TFYeqiifflricy (R C F G H I,* M P V W) _ 

238 

23.8 

24 

K Tjow TTrfimiencv (J K 0 "£ 7.} 

14 

1.4 

1 

1 * 



Total _ _________ __ _ 


1,000 

100.0 ! 

1 

100 

- _ 


(6) 

The frequencies 

of the 

letters 

of the alphabet, reduced to 

a 

base of 1000, are 

as follows: 






JL . 

-_ 74 

G_ 

16 

T. 

36 Q.. 

3 

V... . 

15 

B_ 

_ 10 

H 

34 

M 

95 R 

76 

W 

16 

C.„. 

31 

I 

74 

N 

79 S 

61 

A 

5 

D„„ 

_ _ 4-2 

j 

2 

0 

75 T_ 

92 

Y 

19 

E_ 

_ 130 

K 

3 

p 

27 U 

28 

Z.._ __ 

1 

F_ 

_ 28 








(T) 

The relative order of 

frequency of the letters is 

as follows: 

E_ 

_ 130 

I-_ 

74 

C. 

31 Y._ 

19 

X._ 

5 

T_ 

. 92 

S_ 

61 

F_ 

28 G.. 

16 

Q 

3 

N_ 

79 

P ’ 

42 

P 

27 W.. 

16 

K._ 

3 

R_... 

~ 76 

L_ 

36 

U 

26 V-. 

15 

J 

2 

0„„ 

75 

H_ 

34 

M _ 

25 B. 

10 

7 

1 

A.... 

_ 74 









- (8) The four vowels A, E, I, 0 (combined frequency 353) ancl the four 

consonants E, R, S, T (combined frequency 308) form 66l out of every 
1,000 letters of plain text; in other words, less than one-third of the 
alphabet is employed in writing two-thirds of normal plain text. 
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b. The data given in Fig. 3 and Table 3 represent the relative fre¬ 
quencies found in a large volume of English telegraphic text of a govern¬ 
mental, administrative character.3 These frequencies will vary somewhat 
with the nature of the text analyzed.- For example, if an equal number 

of telegrams dealing solely with commercial transactions in the leather 
industry were studied statistically, the frequencies would be slightly 
different because of the repeated occurrence of words peculiar to that 
industry. Again, if an equal number of telegrams dealing solely with 
military messages of a tactical character were studied statistically, 
the frequencies would differ slightly from those found above for general 
governmental messages of,an administrative character. 

c. If ordinary English literary text (such as may be found in any 

book,"newspaper, or printed document) were analyzed, the frequencies of 
certain letters would be changed to an appreciable degree. This is 
because in telegraphic text words which are not strictly essential for 
intelligibility (such as the definite and indefinite articles, certain 
prepositions, conjunctions, and pronouns) axe omitted. In addition, 
certain essential words, such as ’'stop", "period", "comma", and the like, 
which are usually indicated in written or printed matter by symbols not 
easy to transmit telegraphically and which must, therefore, be spelled 
out in telegrams, occur very frequently. Furthermore, telegraphic text 
often employs longer and more uncommon words than does ordinary newspaper 
or book text, 1 • • £ - 

d. As a matter of fact, other tables compiled from Army sources 
gave slightly different results, depending upon the source of the text. 
For example, three tables based upon 75,000, 100,000, and 136,257 letters 
taken from various sources (telegrams, newspapers, magazine articles, 
books of fiction) gave as the relative order of frequency for the first 
10 letters the following: 

For 75,000 letters. ETRNIOASDL 

For 100,000 letters.E T R IE 0 AS D L 

For 136,257 letters. ETRNAOISLD 


3 Just as the individual letters constituting a large volume of plain 
text have more or less characteristic or fixed frequencies, so it is 
found that di graphs and trigraphs (two- and three-letter combinations, 
respectively)' have characteristic frequencies, when a large volume of 
text is studied statistically. In Table 6 of Appendix 2, "Letter fre¬ 
quency data - English", are shewn the relative frequencies of all digraphs 
appearing in the 260 telegrams referred to in subpar. 21e. This appendix 
also Includes several other kinds of tables and lists of”:frequency data 
which will be useful to the student in his work. It is suggested that 
the student refer to this appendix now, to gain an idea of the data 
available for his future reference. 

Other languages, of course, each have their own individual charac¬ 
teristic plaintext frequencies of single letters, digraphs, trigraphs, 
etc. A brief summary of the letter frequency data for German, French, 

I! 'JLion, Spanish, Portuguese, and Russian constitute Appendix 5, "Letter 
frequency data - foreign languages". 


RESTRICTED 


36 





REF ID:A56892 


- RESTRICTED 

e. Frequency data applicable purely to English military text were 
compiled by Hitt,^ from a study of 10,000 letters taken from orders and 
reports. The frequencies found by him are given in Tables 4 and 5 ♦ 

- - Table 4. —Frequency table for 10,000 letters of literary English, as compiled by Hitt 

* ALPHABETICALLY ARRANGED “ ’ 

A-_ 

B._ 


197 


E._ 1,277 

T._ 855 

0_ 807 

A_ 778 

N._ 680 

I- 667 


778 

a 

174 

L_ 

_ 872 


8 

141 

H_ 

595 

M._ 

_ 288 

R._ 

651 

296 

L_ 

667 

N..„. 

_ 086 

S._ 

622 

402 

J._ 

51 

0 

_ 807 

T 

855 

,277 

K._ 

74 

p...:. 

— 223 

uZLZ 

308 



ARRANGED ACCORDING TO FREQUENCY 


R._ 

651 

U._ 

- 308 


196 

K__ 

... '74 

u. 

S._ 

622 

C._ 

... 296 

w._ 

176 

J._ 51 

_ 

H.„ 

595 

M. 

... 288 

a 

174 

X 

27 


D_ 

402 

P„_ 

... 223 

B 

141 

7. 

17 

1 

L__ 

372 

F_ 

... 197 

v._ 

112 

Q._ 

... 8 

1 

1 


Table 5—Frequency table for 10,000 letters of telegraphic English, as compiled by Hitt 

ALPHABETICALLY'ARRANGED 


A_ 

813 

G.. 

_ 201 

L._ 

... 392 

0 ...... 

_ 38 

V... 

_ 136 

8 _ 

149 

H„ 

_I. 386 

M_ 

... 273 

R__ 

£77 

w... 

. 166 

c_ 

306 

I„ 

_ 711 

N._ 

... 718 

S._ 

_ 656 

X... 

_ 51 

D._ 

417 

J„ 

_ 42 

0 .. 

... 844 

T 

634 

Y.„ 

_208 

E_ 

1,319 

K_. 

. 88 

P_ 

243 

U 

321 

z... 

6 

F 

’205 












ARRANGED 

ACCORDING TO FREQUENCY 



E.„.. 

1 ,319 

S„ 

_ 656 

U.. 

.. 321 

F...... 

_ 205 

K~. 

_ 88 

0_ 

844 

T._ 

634 

C 

.. 306 

G 

....... 201 

Y 

_ 51 

A_ 

813 

D„ 

_ 417 

M.J_ 

.. 273 

W_ 

_ 166 

J_. 

_ 42 

N._ 

718 

L._ 

_ 392 

P._ 

.. 243 

B . 

_ 149 

Q— 

_ 38 

I _ 

711 

HL 

_ 386 

Y. _ 

.. 208 

V...... 

_ 136 

Z-... 

_ 6 

R. _ 

677 


- 

- 








23. Constancy of the standard or normal unlliteral frequency dis ¬ 
tribution .—a. The relative frequencies disclosed by the statistical 
study of large volumes of text may be considered to be the standard or 
normal frequencies of the letters of written English. Counts made of 
smaller volumes of text will xend to approximate these normal frequencies, 
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and, within certain limits,5 the smaller the volume, the lower will'he 
the degree of approximation to the normal, until, in the case of a very 
short message, the normal proportions may not manifest themselves at all. 
It is advisable that the student fix this fact firmly In mind, for ‘the 
sooner he realizes the true nature of any data relative to the frequency 
of occurrence of letters in text, the less often will his labors toward 
the solution of specific ciphers be thwarted and retarded by too strict 
an adherence to these generalized principles of frequency. He should 
constantly bear in mind that such data are merely statistical generaliza¬ 
tions, that they will be found to hold strictly true only in large Volumes 
of text, and that they may not even be approximated in short messages. 

b. Nevertheless the normal frequency distribution or the "normal 
expectation" for any alphabetic language is, in the last analysis, the 
best guide to, and the usual basis for, the solution of cryptograms of a 
certain type. It is useful, therefore, to reduce the normal, uniliteral 
frequency distribution to a basis that more or less closely approximates 
the volume of text which the cryptanalyst most often encounters in indi¬ 
vidual cryptograms. As regards length of messages, counting only the 
letters in the body, and excluding address and signature, a study of 
the 260 telegrams referred to in par. 21 shows that the arithmetical 
average is 217 letters; the statistical mean, or weighted average®, 
however, is 191 letters. These two results are, however, close enough 
together to warrant the statement that the average length of telegrams is 
approximately 200 letters. The frequencies given in par. 21 have there¬ 
fore been reduced to a basis of 200 letters, and the following unilite¬ 
ral frequency distribution may be taken as showing the most typical 
distribution to be expected in 200 letters of English telegraphic text:. 


S 



ABCDEFGHIJKLMNOP 


Figure h. 


>5 55 ss 

S 3? S 55 55 55 ^ 

QRSTUVWXYZ 


-^It ia useless to go beyond a certain limit in establishing the normal-frequency distribution for a given 

language. As a striking instance of this fact, witness the frequency study made by an indefatigable German, 
ICaeding, who in 1898 made a count of the letters in about 11,000,000 words, totaling about 62,000,000 letters in 
German text. When reduced to a percentage basis, and when the relative order of frequency was determined, 
the results he obtained differed very little from the results obtained by Kasiski, a German cryptographer, from a 
count of only 1,000 letters. See Kaeding, Haeufigkeitswoerterbuch, Steglitz, 1898; Kasiski, Die Geheimschriften 
und die Dechiffj ir-Kunst, Berlin, 1863. 

/■ I 

° The arithmetical average is obtained by adding each different length 
and dividing by the number of different-length messages; the mean is ob¬ 
tained by multiplying each different length by the number of messages of 
that length, adding all products, and dividing by the total number of 
messages. 
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c» The student should take careful note of the appearance of the 
distribution? shown in Fig, 4, for it will he of much assistance to him 
in the early stages of his study. The manner of Betting down the tallies 
should he followed hy him in' making his ovn distributions, indicating 
every fifth occurrence of a letter hy an oblique tally. This procedure 
almost automatically shows the total number of occurrences for each let¬ 
ter, and yet does not destroy the graphical appearance of the distribu¬ 
tion, especially if care is taken to use approximately the same amount 
of space for each set of five tallies. Cross-section paper is very 
useful for this purpose. 

d. The word "uniliteral" in the designation "uniliteral frequency 
distribution" means "single” letter", and it is to be inferred that other 
types of frequency distributions maybe encountered. For example, a 
distribution of pairs of letters, constituting a biliteral frequency 
distribution, is very often vised in the study of certain cryptograms in 
which it is desired that pairs made by combining successive letters be 
listed, A biliteral distribution of A B C D E F would take these pairs: 
AB, BC, CD, DE, EF. The distribution could be made in the form of a 
large square divided up into 676 cells. When distributions beyond bi- 
literal are required (triLiteral, quadriliteral, etc.) they can only be 
made by listing them in some order, for example, alphabetically based on 
the 1st, 2d, 3d, . . . letter. 


? The use of the terms "distribution" and "frequency distribution", 
instead of "table" and "frequency table", respectively, is considered 
advisable from the point of view of consistency with the usual statistical 
nomenclature. When data are given in tabular form, with frequencies 
indicated by numbers, then they may properly be said to be set out in 
the form of a table . When, however, the same data are distributed in a 
chart which partakes of the nature of a graph, with the data indicated 
by horizontal or vertical linear extensions, or by a curve connecting 
points corresponding to quantities, then it is more proper to call such 
a graphic representation of the data a distribution. 


\ 
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24. The three facts which can be determined from a study of the 
unllitoral frequency distribution for a cryptogram . —a. The following 
three facts (to he explained subsequently) can usually he determined from 
on inspection of the uniliteral frequency distribution for a given cipher 
message of average length, composed of letters: 

(1) Whether the cipher "belongs to the substitution or the transpo¬ 
sition class; 4 * 

(2) If to the former, Aether it is monoalphabetic® or non- 
monoalphabetic? in character; 

(3) If monoalphabetic, whether the cipher alphabet is standard 

(direct or reversed) or mixed. • 

b. For immediate purposes the first two of the foregoing determi¬ 
nations are quite important and will be discussed in detail in the next 
tiro paragraphs; the other determination will be touched upon very briefly, 
leaving its detailed discussion for subsequent sections of the text. 

25. Determining the class to which a cipher belong s. — a. The deter¬ 
mination of the class to which a cipher belongs is usually a relatively 
easy matter because of the fundamental difference between transposition 
and substitution os cryptographic processes. In a transposition cipher 
the original letters of the plain text have merely been rearranged, 
without any change whatsoever in their identities, that is, in the 
conventional values they have in the normal alphabet. Hence, the nuidbers 
of vowels (A, E, I, 0, U, Y), high-frequency consonants (D, H, R, S, T), 
medium-frequency consonants (B, C, F, G, H, L, M, P, V, W), and low- 
frequcncy consonants (J, K, Q, X, Z) are exactly the same in the 
cryptogram as they are in the plaintext message. Therefore, the 
percentages of vowels, high-, medium-, and low-frequency consonants are 
the scare in the transposed text as in the equivalent plain text. In a 

® In connection with uniliteral frequency distributions, the term 
monoalphabetic is considered to embrace the concept of monoalphabetic- 
monographic-uniliteral systems only, thus excluding polygraphic and 
multllitox’al systems, both of which, however, usually fall into the 
monoalphabetic category. 

9 The term non-monoalphabetic as applied in this Instance is consi¬ 
dered to embrace all deviations from the characteristic appearance of 
1 monoalphabetic distributions. These deviations include the phenomena 
inherent in polyalphobetic, polygraphic, and multiliteral cryptograms, 
as well as in random text, i.e,, text which appears to have been pro¬ 
duced by chance or accident, having no discernible patterns or 
limitations. 
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substittition cipher, on tlie other hand, the identities or the original 
Icfcbcro or the plain tc:rfc have teen changed, that is, the conventional 
values they have in the normal alphabet have been'altered. Consequently, 
if a count is mode of the various letters present in such a cryptogram, 
it ulll be found that the number of vowels, high-, medium-, and low- 
frequency consonants will usually be quite different in the cryptogram 
from what they are in the original plaintext message. Therefore, the 
percentages of vowels, high-, medium-, and low-frequency consonants are 
usually quite different in the substitution text from what they are in 
the equivalent plain text. Prom these considerations it .follows that if 
in a specific cryptogram the percentages of vowels, high-, medium-, and 
low-frequency consonants arc approximately the some as would be expected 
in normal plain text, the cryptogram probably belongs to the transposition 
class; if these percentages are quite different from those to be expected 
in normal plain text the cryptogram probably belongs to the substitution 
class. 

b. In the preceding subparagraph the'word "probably" was emphasized 
by italicizing it, for there can be no certainty in every case of this 
determination. Usually these percentages in a transposition cipher are 
close to the normal percentages for plain text; usually , in a substitu¬ 
tion cipher, they are far different from the normal percentages for plain 
text. But occasionally a cipher message is encountered which is difficult 
to classify with a reasonable degree of certainty because the message is 
too' short for the general principles of frequency to manifest themselves. 
It is clear that if in actual messages there were no variation whatever 
fTn the normal vowel and consonant percentages given in Tab.l e 3» the 
determination of the class to which a specific cryptogram belongs would 
be an extremely simple matter. But unfortunately there is always some 
variation or deviation from the normal. Intuition suggests that as 
messages decrease in length there may be a greater and greater departure 
from the normal proportions of vowels, high-, medium-, and low-frequency 
consonants, until in’very short messages the normal proportions may not 
hold nt all.' Similarly, as messages increase in length there may be a 
lesser and lesser departore from the normal proportions, until in messages 
totalling a thousand or more letters there may be no difference at all 
between the actual, and the theoretical proportions. But intuition is not 
enough, for in dealing vith specific messages of the length of those 
commonly encountered in practical work the question sometimes arises as 

to exactly how much deviation (from the normal proportions) may be allowed 
for in a cryptogram which shows a considerable amount of deviation from 
the normal and which might still belong to the transposition rather than 
to the substitution class. 

c. Statistical studies have been made on this matter and some graphs 
have been constructed thereon. These are shown in Charts 2 - 5 in. the 
form of simple curves, the use of which will now be explained. Each 
chart contains two curves marking the lower and upper limits, respect - 
ivelv. of the theoretical amount of deviation (from the normal percent¬ 
ages) of vowels or consonants which maybe allowable in a cipher believed 
to belong to the transposition class. 
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d. In Chart 2. curve Vq marks the lower limit of the theoretical 
amount of deviation^ 0 from the number of vowels theoretically expected to 
appear 3 -^ in a message of given length; curve Vg marks the upper limit 

of the same statistic. Thus, for example, in a message of 100 letters 
in plain English there should he between 33 and 47 vowels (A E I 0 U l). 
Likewise, in Chart 3 curves Hp and Hg mark the lower and upper limits as 
regards the high-frequency consonants. In a message of 100 letters-there 
should he between 28 and 42 high-frequency consonants (DURST). -In 
Chart 4 curves Mp and Mg mark the lower and upper limits as regards the 
medium-frequency consonants. In a message of 100 letters there should he 
between 17 and 31 medium-frequency consonants (BCFGHLMPVW). 
Finally, in Chart 5, curves Lp and Lg mark the lower and upper limits as 
regards the low-frequency consonants. In a message of 100 letters there 
should he between 0 and 3 low-frequency consonants (J K Q X Z) . In using 
the charts, therefore, one finds the point of intersection of the vertical 
coordinate corresponding to the length of the message, with the horizontal 
coordinate corresponding to (l) the number of vowels, (2) the number 
of high-frequency consonants, (3) the number of medium-frequency con¬ 
sonants, and'(4) the number of low-frequency consonants actually counted 
in the message. If all four points of intersection fall within the 
area delimited by the respective curves, then the numbers of vowels and 
high-, medium-, and low-frequency consonants correspond with the numbers 
theoretically expected in a normal plaintext message of the same length; 
since the message under investigation is not plain text, it follows that 
the cryptogram may certainly be classified as a transposition cipher. On 
the other hand, if one or more of these points of intersection fall out¬ 
side the area delimited by the respective curves, it follows that the 
cryptogram is probably a substitution cipher. The distance that the point 
of intersection falls outside the area delimited by these curves is a more 
or less rough measure of the improbability of the cryptogram's being a 
transposition cipher. 

e. Sometimes a cryptogram is encountered which is hard to classify 
with certainty even with the foregoing aids, because it has been con¬ 
sciously prepared with a view to making the classification difficult. 

This can be done either by selecting peculiar words (as in "trick crypto¬ 
grams") or by employing a cipher alphabet in which letters of approx ¬ 
imately similar normal frequencies have been interchanged. For example, 

E may be replaced by 0, T by R, and so on, thus yielding a cryptogram 
giving external indications of being a transposition cipher but which is 
really a substitution cipher. If the cryptogram is not too short, a close 
study trill ustially disclose what has been done, as well as the futility of 
so simple a subterfuge. 


In Charts 2-5, inclusive, the limits of the upper and lower curves 
have been calculated to include approximately 70 percent of messages of 
the various lengths,. _ 

The expression "the number of ... theoretically expected to appear" 
is often condensed to "the theoretical expectation of ..." or "the normal 
expectation of ..." 
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f. In the majority of cases, in practical work, the determination of 
the claso to which a cipher of average length belongs can he made from a 
mere inspection of the message, after the cryptanalyst has acquired a 
familiarity with the normal appearance of transposition and of substitu- 



' Chart 2. Curves marking the lover and upper limits of 
the theoretical amount of deviation from the number of vowels 
theoretically expected in messages of various lengths. 

(See subpar. 25d.) 


tion ciphers. In the former case, his eyes very speedily note many high- 
frequency letters, such as E, T, K, R, 0, and S, with the absence of 
low-frequency letters, such as J, K, Q, X, and Z; in the latter case, his 
‘'yes just as quickly note the presence of many low-frcquency letters, and 
n corresponding absence of some of the high-frequency letters. 
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g. Another rather quickly completed test, in the case of the' simpler 
varieties of ciphers, is to look for repetitions of groups of letters * As 
will become apparent very soon, recurrences of syllables, entire wprds and 
short phrases constitute a characteristic of all normal plain text;. Since 
a transposition cipher Involves a change ir the sequence of the letters 
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Chart 3. Curves marking the lower and upper limits of 
the theoretical amount of deviation from the number of high- 
frequency consonants theoretically ejected in messages of 
various lengths. (See subpar. 25d.) 


composing a plaintext message, such recurrences are broken up so that 
the cipher text no longer will show repetitions of more or less lengthy 
sequences of letters. But if a cipher message does show many repetitions 
and these are of several letters in length, say over four or five, the 
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Chart Ur. Curves marking the lower and upper limits of 
the theoretical amount of deviation from the number of medium- 
frequency consonants theoretically expected in messages of 
various lengths. (See subpar. 25 d.) 


medium-, and low-frequency consonants. Then, referring to Charts 2 to 5 , 
he should carefully note whether or not the observed frequencies for these 
categories of letters fall within the limits of the theoretical frequen¬ 
cies for a normal plaintext message of the same length, and he guided 
accordingly. 
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If 

h. It is obvious that the foregoing rule applies only to ciphers 
composed wholly of letters. If a message is composed entirely of figures, 
or of arbitrary signs and symbols, or of intermixtures of letter^J figures 
and other symbols, it Is immediately apparent that the cryptogram, is a 
substitution cipher. j! 



Number of letters in message. 

Chart 5. Curves marking the lower and upper limits of 
the theoretical amount of deviation from the number of low- 
frequency consonants theoretically expected in messages of 
various lengths. (See subpar. 25d.) 


1. Finally, it should be mentioned that there are certain kinds of 
cryptograms whose class cannot be determined by the method set forth in 
subparagraph d above. These exceptions will be discussed in a subsequent 
section of this text.!2 

Section X. 
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26 , Determining whether a substitution cipher is monoalphabetic or 
non-monoalphabetic .—a. It will be remembered that a monoalphabetlc 
substitution cipher is one in which a single cipher alphabet Is employed 
throughout the .:hole message; that is, a given plaintext letter is in¬ 
variably represented throughout the message by one and the same letter in 
the cipher text. On the other hand, a polyalphabetic substitution cipher 
is one in which two or more cipher alphabets are employed within the same 
message; that is, a given plaintext letter may be represented by two or 
more different letters in the cipher text, according to some rule govern¬ 
ing the selection of the equivalent to be used in each case. From this 
it follows that a single cipher letter may represent two or more different 
plaintext letters. A similar situation prevails in the case of multi¬ 
literal substitution, in which a particular cipher letter may constitute 
a part of the equivalents for several plaintext letters, giving rise to 
phenomena resembling those of polyalphabetieity. 

b. It is easy to see why and how the appearance of the uniliteral 
frequency distribution for a substitution cipher may be used to determine 
whether the cryptogram is monoalphabetlc or non-monoalphabetic in char¬ 
acter. The normal distribution presents marked crests and troughs by 
virtue of two circumstances. First, the elementary sounds which the 
symbols represent are used with greatly varying frequencies, it being one 
of the striking characteristics of every alphabetic language that its 
elementary sounds are used with greatly varying frequencies.^ in the 
Becond place, except for orthographic aberrations peculiar to certain 
languages (conspicuously, English and French), each such sound is rep¬ 
resented by the same symbol. It follows, therefore, that since in a 
monoalphabetic substitution cipher each different plaintext letter 
(selementary sound) is represented by one and only one cipher letter 
(^elementary symbol), the uniliteral frequency distribution for such a 
cipher message must also exhibit the irregular crest-and-trough appearance 
of the normal distribution, but with this important modification —the 
absolute positions of the crests and troughs will not be the same as in 
the normal . That is, the letters accompanying the crests and tho troughs 
in the distribution for the cryptogram will be different from those accom¬ 
panying the crests and the troughs in the normal distribution. But the 
marked irregularity or "roughness” of the distribution, that is, the 
presence of accentuated crests and troughs, is in itself an indication 
that each symbol or cipher letter always represents the same plaintext 
letter in that cryptogram. Hence the general rule: A marked crest-and - 
trough appearance in the uniliteral frequency distribution for a given 
cryptogram indicates that a single cipher alphabet is involved and 
constitutes one of the tests for a monoalphabetic substitution cipher . 

c. On the other hand, suppose that in a cryptogram each cipher 
letter represents several different plaintext letters. Some of them are 
of high frequency, others of low frequency. The net result of such a 


■*■3 The student who is interested in this phase of the subject may find 
the following reference of value: Zipf G.K. , Selected Studios of the 
Principle of Relative Frequency in Language , Cambridge, Mass., 1932. 
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situation, so far as the unlliteral frequency distribution for the 
cryptogram is concerned, is to prevent the appearance of any marked 
crests and troughs and to tend to reduce the elements of the distribution 
to a more or less common level. This imparts a "flattened out" appear¬ 
ance to the distribution. For example, in a certain cryptogram of 
polyalphabetic construction, K c =Ep, Gp, and Jpj R c =Ap, Dp, and Bpj 
Xc=0p, Lp, and Fp. The frequencies of Kc, Re, and X c will he approx¬ 
imately equal because the summations of the frequencies of the several 
plaintext letters each of these cipher letters represents at different 
times will be about equal. If this same phenomenon were, true of all the 
letters of the cryptogram, it is clear that the frequencies of the 2 6 
letters, when shown by means of the ordinary uniliteral frequency dist¬ 
ribution, would show no striking differences and the distribution would 
have the flat appearance of a typical polyalphabetic substitution cipher. 
Hence, the general rule: The absence of marked crests and troughs in the 
uniliteral frequency distribution indicates that a complex form of sub ¬ 
stitution is involved . The flattened-out appearance of the distribution , 
then, is one of the criteria for the rejection of a hypothesis of mono - 

alphabetic^ substitution . 

d. The foregoing test based upon the appearance of the frequency 
distribution is only one of several means of determing whether a sub¬ 
stitution cipher is monoalphabetic or non-monoalphabetic in composition. 

It can be employed in cases yielding frequency distributions from which 
definite conclusions can be drawn with more or less certainty by mere 
ocular examination. In those cases in which the frequency distributions 
contain insufficient data to permit drawing definite conclusions by such 
examination, certain statistical tests can be applied. One of these 
tests, called the <J> (phi) test, warrants detailed treatment and is 
discussed in paragraph 27 below. 

e. At this point, however, one additional test will be given be¬ 
cause of its simplicity of application. This test, the A (lambda) or 
blank-expectation test , may be employed in testing messages up to 200 
letters in length, it being assumed that in messages of greater length 
ocular examination of the frequency distribution offers little or no 
difficulty. This test concerns the number of blanks in the frequency 
distribution, that is, the number of letters of the alphabet which are 
entirely absent from the message. It has been found from statistical 
studies that rather definite "laws" govern the theoretically expected 
number of blanks in normal plaintext messages and in frequency distribu¬ 
tions for cryptograms of different natures and of various sizes. The 
results of certain of these studi js have been embodied in Chart 6. 

f. This chart contains two curves. The one labeled P applies to the 
average number of blanks theoretically expected in frequency distributions 
based upon normal plaintext messages of the indicated lengths. The other 
curve, labeled R, applies to the average number of blanks theoretically 
expected in frequency distributions based upon perfectly random assort¬ 
ments of letters; that is, assortments such as would be found by random 


l^CfTTfootnote 8 on page Ho. 
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selection of letters out of a hat containing thousands of letters, all of 
the 2 6 letters of the alphabet being present in equal proportions, each 
letter being replaced after a record of its selection has been made. Such 
random assortments correspond to polyalphabetic cipher messages in which 
the number of cipher alphabets is so large that if uniliteral frequency 
distributions are made of the letters, the distributions are practically 
identical with those which are obtained by random selections of letters 
cut of a hat. 


!■■■■■■■■■■■■■■■■ 
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Chart 6. Curves showing the average number of blanks 
theoretically expected in distributions for plain text (P) 
and for random text (R) for messages of various lengths. 
(See subpar. 26 f.) 


g. In using this chart, one finds the point of intersection of the 
vertical coordinate corresponding to the length of the message, with the 
horizontal coordinate corresponding to the observed number of blanks in 
the distribution for the message. If this point of intersection falls 
closer to curve P than it does to curve R, the number of blanks in the 
message approximates or corresponds more”closely to the number theoreti¬ 
cally expected in a plaintext message than it does to a random (ciphertext) 
message of the same length; therefore, this is evidence that the crypto¬ 
gram is monoalphabetic. Conversely, if this point of intersection falls 
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closer to curve R than to curve P, the number of blanks in the message ap¬ 
proximates or corresponds more closely to the number theoretically expected 
in a random text than it does to a plaintext message of the same length; 
therefore, this is evidence that the cryptogram is non-monoalphabetic. 

27. The <t> (phi) test for determining monoalphabeticity .—a. '.foie 
student has seen in the preceding paragraph how it is possible to deter¬ 
mine by ocular examination whether or not a substitution cipher is mono- 
alphabetic. This tentative determination is based on the presence.of a 
marked crest-and-trough appearance in the uniliteral frequency distribu¬ 
tion, and also on the number of blanks in the distribution. However, when 
the distribution contains a small number of elements, ocular examination 
and evaluation becomes increasingly difficult and uncertain. In such 
cases, recourse may be had to a mathematical test, known as the <f> test, 
to determine the relative monoalphabeticity or non-monoalphabeticity of 
a distribution. 

b. Without going into the theory of probability at this time, or 
into the derivation of the formulas involved, let it suffice for the pre¬ 
sent to state that with this test the "observed value of <J>" (symbolized 
by <|>o) is compared with the "expected value of $ random" (<)> r ) and the 
"expected value of <J> plain" (<#>p). The formulas are <J> r s.0385N(W-l) and, 
for English military text, <j>pr.0667N(N-l), where N is the total number of 
elements in the distribution.15 The use of these formulas is best illus¬ 
trated by an example. 

c. The following short cryptogram with its accompanying uniliteral 
frequency distribution is at hand: 

QCYCH ADSKS YZZQE CYKYK QZYSK 

LSZAC TKFCX LKLKC ESZMX KISZX 


= = _ __ g = _ s' jZ? _ N =50 

ABCDEFGHIJKLMNOPQRSTUVWXYZ 


15 The constant .0385 is the decimal equivalent of 1 / 26 , i.e., the 
reciprocal of the number of elements in the alphabet. The constant .0667 
is the sum of the squares of the probabilities of occurrence of the indi¬ 
vidual letters in English plain text. These constants are treated in 
detail in Military Cryptanalysis, Part II. 
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<j> 0 is calculated "by applying the formula f(f-l) to the frequency (f) of 
each letter and'totaling the result; or, expressed in mathematical. 
notation, 1 ® (j> 0 =£f(f-l). Thus, 

JEf =2 6121 li 831 3 61 356 = 50 

ABCDEFGHIJKLMNOPQRSTUVWXYZ 

if(f-l) S 2 30020 0 0 5660 6 300 62030=188 

For this distribution, <|>r= .0385N(K-l) = .0385 x 50 x = 9 k, and 

<f>p= .O 667 R (W-l)=.0667 x 50 x 49 = 163 . 

3Jow since <j> 0 , 188 , is in fact greater than 4>p, we have a mathematical 
corroboration of the hypothesis that the cryptogram is a monoalphabetic 
substitution cipher. If were nearer to <j> r , then the assumption would 
be that the cryptogram is not a monoalphabetic cipher. If c {> 0 were just 
half way between <Ji r and <}>p, then decision would have to be suspended, 
since no further statistical proof in the matter is possible with this 
particular test. 1 ? 

d. 1 Two further examples may be illustrated: 


(l) ABC DEFGHIJKLMNOPQRSTUVWXYZ ? 

0 026122 0 3220 0 6 <f(f-l)s42 


'16 rpke more usual mathematical notation for expressing would be 
Z 

E fi(fi-l), which is read as "the sum. of all the terms for all integral 
1=A Z 

values of f from A to Z inclusive. In turn, E fi(fi-l) would be expanded 

i=A 

as fA(fA~l)* ’**£c(f‘C“ 1 ) + . • • • .tfz(fz-l) • However, in the 

interest of simplicity the notation 4 . f(f-l) is employed; likewise, the 
notations fa* and <f>p are employed in lieu of the more usual E($r) and E(<j>p). 

•*•? Another method of determining the relative monoalphabetic ity of a 
cryptogram is based upon comparing the index of coincidence (abbr. I.C .) 
of the cryptogram under examination with the theoretical I.C. of plain 
text;. The I.C. of jt message is defined as the ratio of $0 to 4x>; thus, 


in the example above, the I.C. is 


*“95' 


which equals 2. The theoretical 


I.C. of English plain text is 1.73, which is the decimal equivalent of 
* * . • • 

the ratio of the “plain constant" to the "random constant". The 
.0385 ■*- - 

I.C. of random t«xt is 1, l.e., .038$ . 

70355 
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_ _ _ _ - __ _ * s N«25 

(2) ABCDEFGHIJKLMNOPQRSTUVWXYZ 

0 0 0 2 0 0 0.6 0 0 02 00 0 0*2 6 £f(f-l)=l8 

Since both distributions have 25 elements, then for both 

<i>r « .0385 x 25 x 24 = 21, and 

4>p » .0667 x 25 x 24 = 40. 

Hence distribution (l) is monoalphabetic, while ( 2 ) is not« 

e. The student must not assume that statistical tests in cryptanal¬ 
ysis are infallible or absolute in themselves^®* statistical approaches 
serve only as a means to the end, in guiding the analyst to the most 
probably fruitful sources of attack. Since no one test in cryptanalysis 
gives definite proof of a hypothesis (in fact, not even a battery of tests 
gives absolute proof), all applicable statistical means at the disposal 
of the cryptanalyst should be used; thus, in examination for monoalphabet- 
icity, the <j> test, A test, and even other tests!9 could profitably be 
employed. To illustrate this point, if the <f> test is taken on the 
distribution of the plaintext letters of.the phrase 

A QUICK BROWN FOX JUMPS OVER THE LAZY DOG 


•»_=_= =_N=33 

ABCDEFGHIJKLMNOPQRSTUVWXYZ 
2 2 12 2 2 £f(f-l)«20 

<j>j* s 4l; <j)p s 70 

it will be noticed that <j> 0 Is less than half of <f> r , thus conclusively 
"proving" that the letters of this phrase could not possibly constitute 
plain text nor a monoalphabet ic encipherment of plain text in any lan¬ 
guage i The student should be able to understand the cause of this 
cryptologic curiosity. 

18 The following quotation from the Indian mathematician P. C, Maha- 
lanobis, concerning the fallibility of statistics, is particularly 
appropriate in this connection: "If statistical theory is right, predic¬ 
tions must sometimes come out wrong; on the other hand, if predictions 
are always right, then the statistical theory must be wrong."—Sankhya, 
Vol. 10, Part 3, p. 203. Calcutta, 1950. 

■*-9 One of these, the chi-square test , will be treated in a subsequent 
text. 
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28, Determining whether a cipher alphabet is standard (direct or 
reversed) or mixed .--a. Assuming that the uniliteral frequency distri¬ 
bution for a given cryptogram has been made, and that it shows clearly 
that the cryptogram is a substitution cipher and is monoalphabetic in 
character, a consideration of the nature of standard cipher alphabets 2 ^ 
almost makes it obvious how an inspection of the distribution will dis¬ 
close whether the cipher alphabet involved is a standard cipher alphabet 
or a mixed cipher alphabet. If the crests and troughs of the distribu¬ 
tion occupy positions which correspond to the relative positions they 
occupy in the normal frequency distribution, then the cipher alphabet 
is a standard cipher alphabet. If this is not the case, then it is 
highly probable that the cryptogram has been prepared by the use of a 
mixed cipher alphabet. A mechanical test may he applied in doubtful 
cases arising from lack of material available for study; «3ust what this 
test involves, and an illustration of its application will he given in 
the next section, using specific examples. 

b. Of course, if it has been determined that a standard cipher al¬ 
phabet is involved in a particular instance, it goes without saying that 
at the same time it must have "been found whether the alphabet is a direct 
standard or reversed standard cipher alphabet. The difference between the 
distribution of a direct standard alphabet cipher and one of a reversed 
standard alphabet cipher is merely a matter of the direction in which the 
sequence of crests and troughs progresses—to the right, as is done in 
normally reading or writing the alphabet (ABC Z), or to the left, 
that is, in the reversed direction (Z fr, C B A). With a direct standard 
cipher alphabet the direction in which the crests and troughs of the dis¬ 
tribution progress is the normal direction, from left to right; with a 
reversed standard cipher alphabet this direction is reversed, from right 
to left. 


20 See par. 12. 
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SECTION V 

UNILITERAL SUBSTITUTION WITH STANDARD CIPHER ALPHABETS 


Paragraph 

Types of standard cipher alphabets................ 29 

Procedure in enciphermant and decipherment by means of 

unlllteral substitution. .... 30 

Principles of solution by construction and analysis of the 

uniliteral frequency distribution.31 

Theoretical example of solution..••«.*«.... 32 

Practical example of solution by the frequency method............... 33 

Solution by completing the plain-component sequence................. 3^+ 

Special remarks on the method of solution by completing the 

plain-component sequence .. 35 

Value of mechanical solution as a short cut....... 36 

Basic reason for the low degree of cryptosecurity afforded by 

monoalphabetic cryptograms involving standard cipher alphabets... 37 

29* Types of standard cipher alphabets .—a. Standard cipher 
alphabets are of two types: 

(1) Direct standard , in which the cipher component is the normal 
sequence but shifted to the right or left of its point of coincidence in 
the normal, alphabet. Example: 

--►_ 

Plain: ABCIEFGHIJKII^NOPQPSTUVWXYZ 

Cipher: QRSTUWrXYZABCDEFQHIJKLMNOP 

-v 

It is obvious that the cipher component can be applied to the plain 
component at any one of 26 points of coincidence, but since the alphabet 
that results from one of these applications coincides exactly with the 
normal alphabet, a series of only 25 (direct standard) cipher alphabets 
results from the shifting of the cipher component. 

(2) Reversed standard , in which the cipher component is also the 
normal sequence but runs in the opposite direction from the normal. 
Example: 


Plain: ABCDEFGHIJKLMNOPQRSTUVWXYZ 

Cipher: QPOIJMLKJIHGFEDCBAZraJVUTSR 

< - 

Here the cipher component can be applied to the plain component at any 
of 26 points of coincidence, each yielding a different cipher alphabet, 
There is in thie case, therefore, a series of 26 (reversed standard) 
cipher alphabets. 
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b. It is often convenient to refer to or designate one of a series 
of cipher alphabets without ambiguity or circumlocution. The usual me¬ 
thod is to indicate the particular alphabet to which reference 1 b made by 
citing a pair of equivalents in that alphabet, such as, in the example 
above, ApsQ c . The key for the cipher alphabet just referred to, as 

well as that preceding it, is ApsQc, it is said that the key letter 
for the cipher alphabet is Qc» 

c. The cipher alphabet in subpar. a(2), above, is also a reciprocal 
alphabet ? that is, the cipher alphabet contains 13 distinct pairs of equiv¬ 
alents which are reversible. For example, in the alphabet referred to, 
ApsQc and Qp=Ac? BpsP c and Pp=B c , etc. The reciprocity exists through¬ 
out the alphabet and is a result of the method by which it was formed. 
(Reciprocal alphabets may be produced by juxtaposing any two components 
which are identical but progress in opposite directions.’) 

30. Procedure in encipherment and decipherment by means of uni ¬ 
literal substitution. —a. When a message is enciphered by means of 
unlllteral substitution , or simple substitution (as it is often called), 
the individual letters of the message text are replaced by the single¬ 
letter equivalents taken from the cipher alphabet selected by prearrange¬ 
ment. Example: 

Message: EIGHTEEN PRISONERS CAPTURE 1 } 

Enciphering alphabet: Direct standard, Ap=T c 

Plain: ABCDEFGHIJKIMNOPQRSTCmfXY/, 

Cipher: TUVWXYZABCDEFGHIJKLMNOPQRS 

Letter-for-letter encipherment: 

EIGHTEEN PRISONERS CAPTURED 
XBZAMXXG IKBLHGXKL VTIMNKXW 

The cipher text is then regrouped, for transmission, into groups of five. 
Cryptogram: 

XBZAM XXGHC BLHGX KLVTI MNKXW 

b. The procedure in decipherment is merely the reverse of that in 
encipherment • The cipher alphabet selected by prearrangemenb is set up 
with the cipher component arranged in the normal sequence and placed above 
the plain component for ease in deciphering. The letters of the crypto¬ 
gram are then replaced by their plaintext equivalents, as shown below. 

Cipher: ABCDEFGHIJKLMOPQRSTUVWXXZ 
Plain: HIJmfflOPQRSTUVWXYZABCDEFG 

The message deciphers thus: 

Cipher: XBZAM XXGRC BLHGX KLVTI MKKXtf 
Plain: EIGHT EENPR ISONE HSCAP TUBED 

The deciphering clerk rewrites the text in word lengths: 

EIGHTEEN PRISONERS CAPTURED 
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c, In subpar, a, above, the ciyptogram was prepared In final form 
for transmission by dividing the cryptographic text into groups of five* 
This is generally the case in military communications Involving cipher 
systems • It promotes acc racy in telegraphic transmission since an opera¬ 
tor knows he must receive definite number of characters in each group, 
no more and no less. Also, re /ually makes solution of the messages by 
unauthorized persons more difficult because the length of the words, 
phrases, and sentences of the plain text is hidden* If the last group of 
the cipher text in subpar. 30 a had not been a complete group of five 
letters, it might have been completed by adding a sufficient nuniber of 
meaningless letters (ctilled nulls ) * 

31* Principles of solution by construction and analysis of the 
unlliteral frequency distribution *—-a. The analysis of monoalphabetic 
cryptograms prepared by the use of standard cipher alphabets follows al¬ 
most directly from a consideration of the liature of such alphabets. Since 
the cipher component of a standard cipher alphabet consists either of the 
normal sequence merely displaced 1, 2, 3* • • • Intervals from the normal 
point of coincidence, or of the normal sequence proceeding in a reversed- 
normal direction, it is obvious that the unlliteral frequency distribution 
for a cryptogram prepared by means of such a cipher alphabet employed 
monoalphabetically will show crests and troughs whose relative positions 
and frequencies will be exactly the same as in the uniliteral frequency 
distribution for the plain text of that cryptogram. The only thing that 
has happened is that the whole set of crests and troughs of the distribu¬ 
tion has been displaced to the right or left of the position it occupies 
in the distribution for the plain text; or else the successive elements of 
the whole set progress in the opposite direction. .Hencey it follows that' 
the correct determination of the plaintext value of the cipher letter 
marking any crest or trough of the uniliteral frequency distribution, 
coupled with the correct determination of the relative direction in which 
the plain component sequence progresses, will result at one stroke in the 
correct determination of the plaintext values of all the remaining 25 
letters respectively marking the other crests and troughs in that dis¬ 
tribution. The problem thus resolves itself into a matter of selecting 
that point of attack which will most quickly or most easily lead to the 
determination of the value of one cipher letter. The single word 
identification will hereafter be used for the phrase "determination of 
the value of a cipher letter"; to identify a cipher letter is to find 
its plaintext value. 

b. It is obvious that the easiest point of attack is to assume that 
the letter marking the crest of greatest frequency in the frequency 
distribution for the cryptogram represents Ep* Proceeding from this 
initial point, the identifications of the remaining cipher letters marking 
the other crests and troughs are tentatively made on the basis that the 
letters of the cipher component proceed in accordance with the normal 


> 
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alphabetic sequence, either direct or reversed* If the actual frequency 
of each letter marking a crest or a trough approximates to a fairly close 
degree the normal or theoretical frequency of the assumed plaintext 
equivalent, then the initial identification ©c=Ep may he assumed to he 
correct and therefore the derived identifications of the other cipher 
letters also may he assumed to he correct.3- If the original starting 
point for assignment of plaintext values is not correct, or if the direc¬ 
tion of “reading" the successive crests and troughs of the distribution 
is not correct, then the frequencies of the other 25 cipher letters will 
not correspond to or even approximate the normal or theoretical frequencies 
of their hypothetical plaintext equivalents on the basis of the initial 
identification* A new initial point, that is, a different cipher equiv¬ 
alent, must then he selected to represent Ep; or else the direction of 
"reading" the crests and troughs must he reversed. This procedure, that 
Is, the attempt to make the actual frequency relations exhibited hy the 
uniliteral frequency distribution for a given cryptogram conform to the 
theoretical frequency relations of the normal frequency distribution in an 
effort to solve the cryptogram. Is referred to technically as "fitting the 
actual uniliteral frequency distribution for a cryptogram to the theoret¬ 
ical uniliteral frequency distribution for normal plain text", or, more 
briefly, as "fitting the frequency distribution for the c r yptogram to the 
normal frequency distribution '*, or, still more briefly,' fitting the dis¬ 
tribution! to the normal ." In statistical work the expression commonly 
employed in connection with this process of fitting an actual distribu¬ 
tion to a theoretical one is "testing the goodness of fit." The goodness 
of fit may he stated in various ways, mathematical in character.2 

£« In fitting the actual distribution to the normal, it is neces¬ 
sary to regard the cipher comporent (that is, the letters A . . , Z marM/ng 
the successive crests and troughs of the distribution) as partaking of 
the nature of a circle, that is, a sequence closing in upon itself, so that 
no matter with what crest or trough one starts, the spatial and frequency 
relations of the crests and troughs are constant. This manner of regard¬ 
ing the cipher component as being cyclic in nature is valid because it 
is obvious that the relative positions and frequencies of the crests and 
troughs of any uniliteral frequency distribution must remain the same 
regardless of what letter is employed as the initial point of the distri ¬ 
bution . Fig* 5 gives a clear picture of what is meant in this connect¬ 
ion, as applied to the normal frequency distribution. 


-*■ The Greek letter & (theta) is used to represent a character or 
letter without indicating its identity. Thus, instead of the circum¬ 
locution "any letter of the plain text", the symbol ©p is used; and for 
the expression "any letter of the cipher text", the symbol 0 C is used. 

2 one of these tests for expressing the goodness of fit, the X (chi) 
test, will be treated in Military Cryptanalysis, Part II. 
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A B C D E F G H f J U J i O ? Q E B : U V H y Z A F C D E F 
-» -•• -> 


...FEDCBAZYXWVUTSRQPONMLKJIHGFEDCBA 

«- 4 - 4 - 

Figure 5* 


d# In the third sentence of subparagraph b, the phrase "assumed to 
he correct" was advisedly employed In describing the results of the at¬ 
tempt to fit the distribution to the normal, because the final test of 
the goodness of fit in this connection (that is, of the correctness of 
the assignment of values to the crests and troughs of the distribution) 
is whether the consistent substitution of the plaintext values of the ci¬ 
pher characters in the cryptogram will yield intelligible plain text* If 
this is not the case, then no matter how close the approximation between 
actual and theoretical frequencies is, no matter how well the actual fre¬ 
quency distribution fits the normal, the only possible inferences are 
that (l) either the closeness of the fit is a pure coincidence in this 
case and that another equally good fit may be obtained from the same data, 
or else (2) the cryptogram involves something more than simple monoalpha- 
betic substitution by means of a single standard cipher alphabet# For 
example, suppose a transposition has been applied in addition to the 
substitution* Then, although an excellent correspondence between the 
uniliteral frequency distribution and the normal frequency distribution 
has been obtained, the substitution of the cipher letters by their assumed 
equivalents will still not yield plain text. However, aside from such 
cases of double encipherment, instances in which the uniliteral frequency 
distribution may be easily fitted to the normal frequency distribution 
and in which at the same time an attempted simple substitution falls to 
yield intelligible text are rare. It may be said that, in practical 
operations whenever the uniliteral frequency distribution can be made to 
fit the normal frequency distribution, substitution of values will result 
in solution; and, as a corollary, whenever the uniliteral frequency 
distribution cannot be made to fit the normal frequency distribution, 
the cryptogram does not represent a case of simple, monoalphabetic 
substitution by means of a standard alphabet# 
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32* Theoretical example of solution .--a. The foregoing principles 
will "become clearer "by noting the encryption and solution of a theoretical 
example. The following message is to "be encrypted. 

HOSTILE FORCE ESTIMATED AT OWE REGIMENT INFANTRY AND TWO PLATOONS 
CAVALRY MOVING SOUTH ON QUINNIMONT PIKE STOP HEAD OF COLUMN NEARING ROAD 
JUNCTION SEVEN THREE SEVEN COMMA EAST OF GREENACHE SCHOOL FIRED UPON BY 
OUR PATROLS STOP HAVE DESTROYED BRIDGE OVER INDIAN CREEK. 

b. First, solely for purposes of demonstrating certain principles, 
the unillteral frequency distribution for this plaintext message is 
presented in Figure 6. 


8 =8 
- 8 8 8 - =5 

8 8 8 88 8%8 
8 S§8 — 8 s: I I ^ 8 8 8 — — 

8ss8888888-s=88888-88888- 1 

ABCDEFGHIJKLMNOPQRSTUVWXYZ 

Figure 6 . 


a. Now let the foregoing message be encrypted monoalphabetically by 
the following standard cipher alphabet, yielding the cryptogram shorn 
below and the frequency distribution shorn in Figure 7. 


Plain ----ABCDEFGHIJKLMNOPQRSTUVWXYZ 
Cipher - - -GHIJKLMNOPQRSTUVWXYZABCDEF 

Plain - - - HOSTI LEFOE CEEST BIATE DATON EREGI MEDTI NFANT RYAND 

Cipher - - NUYZO RKLUX IKKYZ OSGZK JGZUT IIXKMO SKTZO TLGTZ XEGTJ 

Plain-TWOPL ATOON SCAVA LRYMO VINGS OTJTHO NQUHf NB10N TPIKE 

Cipher - - ZCUVR GZUUT YIGBG RXESU BOTMY UAZNU TWAOT TOSUT ZVOQK 

Plain-STOPH EADOF COLUM NIIEAR INGRO ADJUN CTION SEVEN THREE 

Cipher - - YZUVN KGJUL IURAS TTKGX OTMXU GJPAT IZOUT YKBKT ZNXKK 

Plain-- SEVEN COMM. EASTO FGREE S/UlBB SCHOO IFIRE DUPON BYOUR 

Cipher - - YKBKT IUSSG KGYZU IMXKK TGIXK YINUU RLOXK JAVUT HEUAX 

Plain-PATRO LSSTO PHAVE DESTR OYEDB RIDGE OVERI NDIAN CREEK 

Cipher - - VGZXU RYYZU VNGBK JKYZX UEKJH XOJMK UBIIXO TJOGT IXKKQ 


Cryptogram 


NUYZO 
SKTZO 
RXESU 
YZUVN 
IZOUT 
L M X K K 
VGZXU 
U B K X 0 


RKLUX 

TLGTZ 

BOTMY 

KGJUL 

YKBKT 

TGIXIC 

RYYZU 

TJOGT 


IKKYZ 
XEGTJ 
U A Z H U 
IURAS 
ZNXKK 
YINUU 
VNGBK 
IXKKQ 


OSGZK 
ZCUVR 
TWAOT 
TTKGX 
YKBKT 
RLOXK 
J II Y Z X 


JGZUT 

GZUUT 

TOSUT 

OTMXU 

IUSSG 

JAVUT 

UEKJH 


KXKMO 
YIGBG 
Z V 0 Q K 
GJPAT 
KGYZU 
HEUAX 
X 0 J II K 
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JC: as -c 

g g 5 g g g = g 

g 5§g - g » =5 g g ^ ggg 

gg» § g5;ggggggg^^ggggg^ggg 

ABCDEFGHIJKLMNOPQRSTUVWXYZ 

' ' Figure 7» 


d« Let the student now compare Figs. 6 and 7# "which have teen su¬ 
perimposed in Fig. 8 for convenience in examination. Crests and troughs 
are present in "both dis brlbutions; moreover their relative positions and 
frequencies have not been changed in the slightest particular. Only the 
absolute position of the sequence as a whole has "been displaced six 
places to the right in Fig. J, as compared with the absolute position of 
the sequence in Fig. 6, 


I B 

K S 


gssggggggg^sslgggg^ggggg^ - § 

ABCDEFGHIJKLMNOPQRSTUVWXYZ 

t 

O.I.Z.^ 

is sS 


g g _ § gisggggggg-ssggggg^gg 

ABCDEFGHIJ KLMNOPQRSTUVWXY 


TU __Q 


e. If the two distributions are compared in detail the student will 
clearly understand how easy the solution of the cryptogram would be to one 
who knew nothing about how it was prepared. For example, the frequency of 
the highest crest, representing Ep in Fig. 6 is 28 ; at an interval of four 
letters before Ep there is another crest representing Ap with frequency 
1 6 . Between A and E there is a trough, representing the medium-frequency 
letters B, C, D. On the other side of E, at an interval of four letters, 
comes another crest, representing I with frequency 14. Between E and I 
« there is another trough, representing the medium-frequency letters F, G, II 
Compare these crests and troughs with their homologous crests and troughs 
in Fig. 7. In the latter, the letter K marks the highest crest in the 
distribution with a frequency of 28; four letters before K there is an¬ 
other crest, frequency l6, and four letters on the other side of K there 
is another crest, frequency l4. Troughs corresponding to B, C, D and F, 

G, H are seen at H, I, J and L, M, N in Fig. 7* In fact, the two dis¬ 
tributions may be made to coincide exactly, by shifting the frequency 
distribution for the cryptogram six places to the left with respect to 
the distribution for the equivalent plaintext message, as shorn herewith. 
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s 2 = 2 

B - 2 II v i 

gg 2 § 22 2 = 2 

& 2 5 § 2 -2 '«. % I I v I I I s - 

2 = 2 2 2 2 2 2 2 - = 2 2 2 2 2 - 2 2 2 2 2 - 2 

ABCDEFGHIJKLMNOPQRSTUVWXYZ 


3 


£ 


2 = 2222222 -=; 
GHIJKLMNOPQRS 


2 2 
2 2 
2 2 


2 = 2 
2 2 2 — — 
222 - 22222 - = 
TUVWXYZABCDEF 


Figure 9« 


f. Let us suppose now that nothing is known about the process*of 
encryption, and that only the cryptogram and its uniliteral frequency 
distribution is at hand. It is clear that simply bearing in mind the 
spatial relations of the crests and troughs in a normal frequency dis¬ 
tribution would enable the cryptanalyst to fit the distribution to the 
normal in this case. He would naturally first assume that K c sEp, from 
which it would follow that if a direct standard alphabet is involved, 
L c sFp,Mc=Gp, and so on, yielding the following (tentative) deciphering 
alphabet: 


Cipher - - -ABCDEFGHIJKLMNOPQRSTUVWXYZ 

Plain- UVWXYZABCDEFGHIJKLMNOFQRST 

* * 

(I ^ * 

jg. Now comes the final test i tf these assumed values are substi¬ 
tuted in the cipher text, the plain texfc immediately appears. Thus: 

NUYZO RKLUX IKKYZ OSGZK JGZUT etc. 

HOSTI LEFOR CEEST I MATE DATON etc. 

h. It should be clear, therefore, lhat the initial selection of 
Gc as the specific key (that is, to represent Ap) in the process of 
encryption has absolutely no effect upon the relative spatial and 
frequency relations of the crests and troughs of the frequency distri¬ 
bution for the cryptogram. If Qc had been selected to represent Ap, 
these relations would still remain the same, the whole series of crests 
and troughs being merely displaced further to the right of the positions 
they occupy when Gc-Ap, 

33« Practical example of solution by the frequency method .-- 
a. The ease of direct standard~alnhabeb ciphers .-- (!) The following 
cryptogram is to be solved by applying the foregoing principles: 


N W N V H 
W B N J B 


C A X X Y 
C X 0 W N 


B J C C J 
F C X W B 


L T R W P 
C X Y Y N 


X D A Y X 
C II A B L 


B R C R X 
X U R U 0 
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(2) From the presence of so man/ low-frequency letters such as B, 

W, and X it is at once suspected that this is a substitution cipher. But 
to illustrate the steps, that must he taken in difficult cases in order to 
he certain in this respect, a uniliteral frequency distribution is con¬ 
structed, and then reference is r^fte to Charts 2 to 5 to note whether the 
actual numbers of vowels, high-, end low-frequency consonants 

fall inside or outside the areas -5''limited by the respective curves. 


_ _ s * % = - =_ 

ABCDEFGHIJKLMNOPQRSTUVWXYZ 

Figure 10 a. 


Letters 

Frequency 

Position with respect to areas 
delimited by curves 

Vowels (A F! I 0 U Y) L ____ . ~ — - - - 

‘ 10 
12 
26 
12 

Outside, chart 1. 

Outside, chart 2. 

Outside, chart 3. 

Outside, chart 4. 

Highey CWiAonfintp (D R 5? T) .. ... __ 

Medium-frequency Consonants (BCFGHLMPV W). 

TjOW-f r eqiien«y Onriaonn/ntfl i( J K Q X 7.) 

Total__ , „ _ _ - __ _ _ _ 

60 




(3) All four points falling completely outside the areas delimited by 
the curves applicable to these four classes of letters, the cryptogram is 
clearly a substitution cipher. 

(4) The appearance of the frequency distribution, with marked crests 
and troughs, indicates that the cryptogram is probably monoalphabetic. At 
this point the <f> test is applied to the distribution. The observed value 
of $ is found jfco be 258 , while the expected value of <i> plain and <[> random 
are calculated to be 236 and 136 , respectively. The fact that the ob¬ 
served value is not only closer to but greater than <j>p is taken as 
statistical evidence that the cryptogram is monoalphabetic. Furthermore, 
reference being made to Chart 6 , the point of intersection of the message 
length (60 letters) and the number of blanks (8) falls directly on 

curve P; this is additional evidence that the message is probably mono- 
alphabetic . 

(5) The next step is to dete rmin e whether a standard or a mixed 
cipher alphabet is involved. This is done by studying the positions and 
the sequence of crests and troughs in the frequency distribution, and 
trying to fit the distribution to the normal. 
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(6} The first assumption to be made is that a direct standard cipher 
alphabet is involved# The highest crest in the distribution occurs over 
X(ji Let it be assumed that Xq*®p# Then Yq, Zq, Aq, • • • # Gp^ 

H p , . . . ., respectively; thus: 


sM- - _ a = a_ 

Cipher*...A BCDEFGHIJKLMNOPQRSTUVWXYZ 
Plain#....H IJKLMNOPQRSTUVWXYZABCDEFG 

Figure 10b. 

tf 

It may be seen quickly that the approximation to the expected frequencies 
is very poor. There are too many occurrences of J p , Qp, Up and Fp and 
too few occurrences of N p , Op, R p , S p , T p and A p . Moreover, if a substi¬ 
tution is attempted on this basis, the following is obtained for the first 
two cipher groups: 

Cipher.N WNVHCAXXY 

"Plain texfc*U DUCO JHEEF 

This is certainly not plain text and it seems clear that X c is not E_, if 
the hypothesis of a direct standard alphabet cipher is correct. A p 
different assumption will have to be made. 

(7) Suppose Cc=Ep. Going through the same steps as before, again 
no satisfactory results are obtained. Further trials^ are made along the 
same lines, until the assumption N c sEp is tested: 


sgj_ _ _ sr _ =? _ __ =_ggg 

Cipher....A BCDEFGHIJKLM SO PQRSTUVWXYZ 
Plain. RSTUVWXYZABCDEFGHIJKLMNOPQ 

Figure 10c. 

(8) The fit in this case is quite good; possibly there are too few 
occurrences of Ap, Dp, and Rp. But the final test remains: trial of the 
substitution alphabet on the cryptogram itself. This is done and the 
results are as follows: 


C: 

N W N V H 

C A X X Y 

B J C C J 

L T R W P 

X D A Y X 

B R C 

R 

X 

P: 

ENEMY 

TROOP 

SAHA 

C K I N G 

0 U R P 0 

SIT 

I 

0 

C: 

W B N J B 

C X 0 W N 

F C X W B 

C X Y Y N 

CNABL 

X U R 

W 

0 

P: 

N S E A S 

T 0 F N E 

¥ T 0 N S 

T 0 P P E 

T E R S C 

0 L I 

N 

F 


ENEMI TROOPS ATTACKING OUR POSITIONS EAST OF NEWTON. PETERS COL INF. 


3 It is unnecessary, of course, to write out all the alphabets and 
pseudo-decipherments, as shown above, when testing assumptions. This is 
usually done mentally. 
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(9) It lo always advisable to note the specific key. In this case 
the eori'espondence between any plaintext letter and its cipher equivalent 
will Indicate the key* Although other conventions ore possible, end 
equally valid,’ it is usual, however, to indicate the key by noting the 
cipher equivalent of Ap. In this case ApSj c , 

b. The case of reversed standard alphabet ciphers .---(l) Let the 
following cryptogram and its frequency distribution be studied, 

F W F X L QSVVU R J Q Q J H Z B W D VPSUV RBQBV 

WRFJR QVEWF IT Q V W R QVUUF QFSEH V Y B W E 

(2) The preliminary steps illustrated above, under subpar, a (l) 

to (4) inclusive, in connection with the test for class and Eonoolphabot- 
icity, will here be omitted, since they arc exactly the same In nature • 
The result is that the cryptogram is obviously a substitution cipher and 
is monoalphabetic, 

(3) Assuming that It is not known whether a direct or a reversed 
standard alphabet is involved, attempts are at once made to fit the fre¬ 
quency distribution to the normal direct sequence • If the student vill 
try them he will soon find out that those are unsuccessful. All this 
takes but a few minutes. 

(4) The next logical assumption is now made, via ., that the cipher 
alphabet is a reversed standard alphabet. When on this basis F c is 
assumed to be Ep, the distribution con readily be fitted to the normal, 
practically every crest and trough in the actual distribution correspond¬ 
ing to a crest or trough in the expected distribution. 


Cipher....A BCDEFGHIJKLMNO P S T u9 W X Y Z 

Plain. JIHGFEDCBAZYXWVUTSRQPOKMLK 

Figure lOd. 

( 5 ) When the substitution is made in the cryptogram, the following 
is obtained. 

Cryptogram...F W.FXL QSVVU RJQQJ 
Plain text. ..E NEMY TROOP SATTA 

(6) The plaintext; message is identical with that in subpar. a. The 
specific key in this case is also Ap=J c . If the student vill compare the 
frequency distributions in the two cases, he vill note that the relative 
positions and extents of the crests and troughs are identical; they 
merely progress in opposite directions. 
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c. General note on solution by the frequency method .--In actual 
practice, the procedure of subpars. a and b are given a more rapid treat¬ 
ment than that just described, the practical treatment being based, not 
on the initial finding of some single crest or trough, but rather on locat¬ 
ing the more readily-discernible clusters of crests which usually appear 
in a distributio n, su ch as the distinctive crest-patterns representing 
"A. ,,E».,1" and "RST". These crest-patterns are searched for, with a 
quick scanning of the distribution, and then the relative placement with 
respect to each other is tested to see if it conforms to the expectation 
for a direct standard cipher alphabet, and, if not, then for a reversed 
standard cipher alphabet. During this latter step, which consists "of 
little more than counting in one direction and then (when necessary) in 
the other, the blank (or near ly-blari k) expectation of "uK" followed by 
the characteristic curve for "Lf-itfOP” and the blank "Q" are considered, 
as a means of either substantiating or invalidating the original 
"identification” of the crests, 

3^, Solution by completing the plain-component sequence ,— 
a. The case of direeb standard alphabet cipher s',—(l) The foregoing 
method of analysis, involving as it does the construction of a uniliteral 
frequency distribution, was termed a solution by the frequency method 
because it involves the construction of a frequency distribution and its 
study. There is, however, another method which is much more rapid, almost 
wholly mechanical, and which, moreover, does not necessitate the con¬ 
struction or study of any frequency distribution vhatever. An under¬ 
standing of the method follows from a consideration of the method of 
encipherment of a message by the use of a single, direct standard cipher 
alphabet, 

( 2 ) Note the following encipherment: 

Message-TWO CRUISERS SUEK 

Enciphering Alphabet 

Plain- ABCDEFGHIJKLMNOPQRSTUVWXYZ 

Cipher—— GHIJKLMEOPQRSTUVWXYZABCDEF 

Encipherment 

Plain text-TWO CRUISERS SUITE 

Cryptogram-ZCU IXAOYKXY YATQ 

Cryptogram 

ZCUIX AOYKX Y Y* A T Q 

(3) The enciphering alphabet shc^m above represents a case wherein 
the sequence of Reuters of both components of the cipher alphabet is the 
normal sequence, with the sequence fomi •>_" the cipher component v.erely 
shifted six places to the left (or 20 p. sit ices to the right 1 ) of the 
position it occupies in the normal aiph'bet. If, therefore, two strips 
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of paper "bearing the letters of the normal sequence, equally spaced, are 
regarded aa the two components of the cipher alphabet and are juxtaposed 
at all of the 2J> possible points of coincidence, it is obvious that one 
of those 25 juxtapositions must correspond to the. actual juxtaposition 
shown in the enciphering alphahc* .directly above « It is equally obvious 
that if a record were kept of the results obtained by applying the values 
given at each juxtaposition to the letters of the cryptogram, one of these 
results would yield the plain text of the cryptogram. 

(4) Let the work be systematized and the results set down in an 
orderly manner for examination. It is obviously ■unnecessary to juxtapose 
the two components so that AcsAp, for on the assumption of a direct stan¬ 
dard alphabet, juxtaposing two direct normal components at their normal 
point of coincidence msi.'ely yields .plain text. The next possible juxta¬ 
position, therefore, is Ac=Bp. Let the juxtaposition of the two sliding 
strips therefore be A c =Bp, as shown here; 


Plain- ABCDEFGHIJIOMOPQPSTUV^-naZ 

C ipher-ABCDEFGniJK^IWOPQBSTlJVWXyZABCIMFGHIJKIMHOPCJPSTtMJXIZ 


The values given by this juxtaposition are substituted for the letters 
of the cryptogram and the following results are obtained. 

Cryptogram- ZCUIX AOYKX YYATQ 

1st Test— ,r Flain “text" ADVJY BPZLY ZZBUR 

This certainly is not intelligible text; obviously, the two components 
were not in the position indicated in this first test. The plain 
component is therefore slid one interval to the left, making Ac=Cp, and 
a second test is made. Thus 

Plain- ABCDEFGHIJKLMNOPQRSTUVWXYZ 

C ipher- ABCDEFGHIJFCLMOPQPSTUVWXYZABCDEFGHI JEOMKTOPQpSTUVWXYZ 

Cryptogram- ZCUIX AOYKX YYATQ 

2d Test— ."Plain text’ 1 B E W K Z C Q A M Z A A C V S 


Neither does the second test result in disclosing any plain text. But, 
if the results of the two tests are studied a phenomenon that at first 
seems quite puzzling comes to light. Thus, suppose the results of the 
two tests are superimposed in this fashion. 

Cryptogram- ZCUIX AOYKX YYATQ 

1st Test--"Plain text* 1 ADVJY B P Z L Y ZZBUR 

2a Test-“Plain text” BEWKZ CQAMZ AACVS 

^ One of the strips should bear the sequence repeated. This permits 
juxtaposing the two sequences at all 26 possible points of coincidence 
so as to have a complete cipher alphabet showing at all times. 
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(!?) Note what has happened* The net result of the two experiments 
was merely to continue the normal sequence "begun "by the cipher letters at 
the heads of the columns of letters. It is obvious that if the normal 
sequence is completed in each column the results will be exactly the same 
as though the whole set of 25 possible tests had actually been performed . 
Let the columns therefore be completed, as shorn in Fig. 11. 

ZCUIXAOYKXYYATQ 
ADV JYBPZLYZZBUR 
B E W K Z C QAMZAACV S 
CFXLADRBNABBDWT 
DGYMBESCOBCCEXU 
EHZNCFTDPCDDFYV 
FIAODGUEQDEEGZW 
. GJBPEIIVFREFFHAX 
HKCQFIWGSFGGIBY 
ILDRGJXHTGHHJCZ 
JMESHKYIUHIIKDA 
KNFTILZJVIJJLEB 
LOGUJMAKNJKKMFC 
MPHVKNBLXKLLNGD 
NQINLOCMYLMMOIIE 
ORJXMPDNZMNNPIF 
PSKYNQEOANOOQJG 
QTLZORFPBOPPRKH 
RUMAPSGQCPQQSLI 
SVNBQTHRDQRRTMJ 
*TWOCRUISERSSUNK 
UXPDSVJTFSTTVOL 
VYQETWKUGTUUWPM 
WZRFUXLVHUVVXQN 
X A S G V Y M W I V W W Y R 0 
YBTEWZNXJWXXZSP 

Figure 11. 

An examination of the successive horizontal lines of the diagram discloses 
one and only one line of plain text, that marked by the asterisk and read¬ 
ing T W 0 C RU I S E R S S U UK. 

(6) Since each column in Fig. 11 is nothing but a normal sequence, 
it is obvious that instead of laboriously writing down these columns of 
letters every time a cryptogram is to be examined, it would be more con¬ 
venient to prepare a set of strips each bearing the normal sequence 
doubled (to permit complete coincidence for an entire alphabet at any set¬ 
ting), and have them available for examining any future cryptograms. In 
using such a set of sliding strips in order to solve a cryptogram prepared 
by means of a single direct standard cipher alphabet, or to mke a test to 
determine whether a cryptogram has been so prepared, it is only necessary 
to "set up" the letters of the cryptogram on the strips, that is, align 
them in a single rear across the strips (by sliding the individual strips 
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up or down). The successivc; horizontal lines, called generatrices (sin¬ 
gular, generatrix )5, are then examined in a scorch for intelligible text. 

If the cryptogram roolly "belongs to this single type of cipher, one of 
the Generatrices will exhibit intelligible text .all the tray across; this 
text trill practically invariably be the plain text of the message. This 
method of analysis may be termed a jolut '.on by completing the plain *- 
component sequence . Sometimes it is referred to as "running down*'~the 
sequence. The principle upon which the method is based constitutes one 
of the cryptanalyst's most valuable tools« 

b. The case of reversed standard alphabets (l) The method describ¬ 
ed under Bubpar. a may also be applied, in slightly modified form, in the 
case of a cryptogram enciphered by a single reversed standard alphabet. 

The basic principles are identical in the two cases, as will now be demon¬ 
strated. 

(2) Let two sliding components be prepared as before, except that in 
this case one of the components must be a reversed normal sequence, the 
other, a direct normal sequence. 

(3) Let the two components be juxtaposed A to A, as shotm below, 
and then let the resultant values be substituted for the letters of the 
cryptogram. Thus: 


CRYPTOGRAM 

IT K S 13 P MYOCP OOMTW 

Plain- ABCDEMJHIJKLMNOPQRSTU^'JXIZ 

C ipher- ZYKSTVTJTSRQPOM/IIKJinGFElXrBAZYXi-rTTJrSRQPOITMLKJIHGFEDCBA 

Cryptogram- HKSEP MYOCP OOMTW 

1st Test—"Plain text 1 ' IT Q I W L 0 C M Y L M M 0 H E 

(h) This does not yield intelligible text, and therefore the revers¬ 
ed component is slid one space forward and a second test is made. Thus; 

Plain- ABCDEFGHIJ3KXMHOPQRSTU3ATXYZ 

Cipher-ZYXWVUTSRQPOmteJIHGFEDCBAZmrVin!SR(^OI®'IUGTIHGFEDCBA 

Cryptogram-■'-N K £ E ° MYOCP OOMTW 

2d Test—"Plain text" 0 R J X M P D IT Z M IT IT P I F- 

(5) Neither does the second,'test yield intelligible text. But let 
the results of the two tests be superimposed. Thus: 

Cryptogram- NKSEP MYOCP OOMTW 

1st Test—"Plain text' 1 N Q I W L 0 C M Y L M M OTe 

2d Test-"Plain text" ORJXM PDNZM N IT P I F 


^ Pronounced: j <*n x er-a-tri^sez and jen^er-a^trxks , respectively. 

^ A set of heavy paper strips, suitable for use in completing the 
plain-component sequence, has been prepared for use as a training aid in 
connection with the courses in Military Cryptanalysis. 
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( 6 ) It is seen that the letters *'f t'i« ’’plain text” given hy the 
second trial are merely the continuants of the normal sequences Initiated 
hy the letters of the "plain text" given “by the first trial. If these 
sequences are "run down"—that is, completed within the columns—the 
results must obviously he the same as though successive tests exactly 
similar to the first two were applied to the cryptogram, using one 
reversed normal and one direct normal component. If the cryptogram has 
really been prepared by means of a single reversed standard alphabet, 

one of the generatrices of the diagram that results from completing the 
sequences must yield intelligible text. 

(7) Let the diagram be made, or better yet, if the student has 
already at hand the set of sliding strips referred to in footnote 6 to 
page 69 , let him "set up" the letters given by the first trial* Fig* 12 
shows the diagram and indicates the plaintext generatrix. 

NKSEPMYOC POOMTW 
NQmOCMYlHMOHE 
ORJXMPDNZMNNPIF 
PSKYHQEOANOOQJG 
QTLZORFPBOPPRKH 
RUMAPSGQCPQQSLI 
SVNBQTHRDQRRTMJ 
*TWOCRUISERSSUIIK 
UXPDSV JTFSTTVOL 
VYQETWKUGTUUWPM 
WZRFUXLVHUVVXQH 
XASGV»YMW IVWWYRO 
YBTHWZNXJWXXZSP 
ZCUIXAOYKXYYATQ 
A D V JYBPZLYZZBUR 
BEWKZC QAMZAACV S 
CFXLADRBNABBDWT 
DGYMBESCOBCCEXU 
EHZNCFTDPCBDFYV 
FIAODGUEQDEEGZW 
GJBPEHVFREFFHAX 
HKCQFIWGSFGGIBY 
ILDRGJXHTGHHJCZ 
JMESHKYIUHIIKDA 
KHFTILZJVI JJ1EB 
L 0 G U J MiA KW JKKMFC 
MPHVKNBLXKLLNGD 

Figure 12. 

(8) The only difference in procedure between this case and the 
preceding one (where the cipher alphabet vas a direct standard alphabet) 
is that the letters of the cipher text a,be first "deciphered" by means 
of any reversed standard alphabet and then the columns are "run do’tm", 
according to the normal ABC . « . Z sequence. For reasons which will 
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■become apparent very soon, the first step in this method is technically 
termed converting the cipher letters into their plain-component equiva ¬ 
lents ; the second step is the same as before, viz,, completing the plain - 
component sequence « 

35. Special remarks on the method of solution by completing the 
plain-component sequence ,--a. The terms employed to designate the steps 
in the solution set forth in par, 3^b(8), viz ,, "converting the cipher 
letters into their plain-component equivalents" and "completing the plain- 
component sequence", accurately describe the process. Their meaning will 
become more clear as the sbudent progresses with the work. It may be said 
that whenever the components of a cipher alphabet are known sequences, no 
matter how they are composed, the difficulty and time required to solve 
any cryptogram involving the use of those components is considerably re¬ 
duced, In some cases this knowledge facilitates, and in other cases is 
the only thing that makes possible, the solution of a very short cryptogram 
that might otherwise defy solution . Later on an example will be given to 
illustrate what is meant in this regard, 

b. The student should take note, however, of two qualifying expres¬ 
sions that were employed .Ln a preceding paragraph to describe the results 
of the application of the method. It was stated that "one of the gener¬ 
atrices will exhibit intelligible text all the way across ; this text will 
practically invariably be the plain text’."” Will there ever be a case in 
which more than one generatrix will yield intelligible text through its 
extent? That obviously depends almost entirely on the nuiriber of letters 
that are aligned to form a generatrix. If a generatrix contains hut a 
very few letters, only five, for example, it may happen as a result of 
pure chance that there will be two or more generatrices shewing what 
might be "intelligible text." Note in Fig. 11, for example, that there 
are several cases in which 3-letter and 4-letter English words (LAD, COB, 
MESH, MAPS, etc.) appear on generatrices that are not correct, these 
words being formed by pure chance. But there is not a single case, in this 
diagram, of a 5-letter or longer word appearing fortuitously, because 
obviously the longer the word the smaller the probability of its appear¬ 
ance purely by chance; and the probability that two generatrices of 15 
letters each will both yield intelligible text along their entire length 

is exceedingly remote, so remote, in fact, that in practical cryptology 
such a case may be considered nonexistent .7 

c. The student should observe that in reality there is no difference 
whatsoever in principle between the two methods presented in subpars. a 
and b of par, 34» Xu the former the preliminary step of converting the 
cipher letters into their plain-component equivalents is apparently not 
present hut in reality it is there. The reason for its apparent absence 
is that in that case the plain component of the cipher alphabet is ident¬ 
ical in all respects vith the cipher component, so that the cipher letters 


^ A person trith patience and an inclination toward the curiosities of 
the science might construct a text of 15 or more letters which would yield 
two "intelligible" texts on the plain-component completion diagram. 
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require no conversion, or, rather, they are identical with the equivalents 
that would result if they were converted on the basis A c sAp, In fact, if 
the solution process had “been arbitrarily initiated by converting the ci¬ 
pher letters into their plain-component equivalents at the setting A<>=0p, 
for example, and the cipher component slid one interval to the right 
thereafter, the results of the first and second tests of par. 34a would 
be as follows: 

Cryptogram-. ZCUIXAOYKXYYATQ 

1st Test—"Plain text 11 — N Q I W L 0 C MYL MMOH E 

2d Test—"Plain text’*— OR JXMPDUZMOPIF 

Thus, the foregoing diagram duplicates in every particular the diagram 
restating from the first two tests under par. 34b: a first line of cipher 
letters, a second line of letters derived from them but showing externally 
no rel <.,ionship with the first line, and a third line derived immediately 
from the. second line by continuing the direct normal sequence." TM« '^Oint 
is brought to attention only for the purpose of showing that a sirvxe, 
broad principle is the basis of the general method of solution by complet¬ 
ing the plain-component sequence, and once the student has this ’.’irmly in 
mind he will have no difficulty whatsoever in realizing whe?. principle 
is applicable, what a powerful cryptanalytic tool it can be, and what 
results he may expect from its application in specific instances. 

d. In the two foregoing examples of the application of the prin¬ 
ciple, the components were normal sequences; but it should be clear to 
the student, if he has grasped"what has been said in the preceding sub- 
paragraph, that these components may be mixed sequences which, if known 
(that is, if the sequence of letters comprising the sequences is known to 
the cryptanalyst), can be handled Just as readily as can components that 
are normal sequences. 

e. It is entirely immaterial at what points the plain and the cipher 
components are juxtaposed in the preliminary step of converting the cipher 
letters into their plain-component equivalents. For example, in the case 
of the reversed alphabet cipher solved in par. 3^b, the two components 
were arbitrarily juxtaposed to give the value Ap=Ac, but they might have 
been juxtaposed at any of the other 25 possible points of coincidence 
without in any way affecting the final result, viz ., the production of 
one plaintext generatrix in the completion diagram. 

36. Value of mechanical solution as a short cut .—a. It is evident 
that the very first step the student should take in his attempts to solve 
an unknown cryptogram that is obviously a substitution cipher is to try 
the mechanical method of solution by completing the plain-component se¬ 
quence, using the normal alphabet, first direct, then reversed. This 
hakes only a very few minutes and is conclusive in its results. It saves 
the labor and trouble of constructing a frequency distribution in case 
the cipher is of this simple type. Later on it will be seen how certain 
variations of this simple type may also be solved by the application of 
this method. Thus, a very easy short cut to solution is afforded, which 
even the experienced cryptanalyst never overlooks in his first attack on 
an unknown cipher. 
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b. It Is Important now to note that if neither of the tvo foregoing 
attempts is success Pul in bringing plain text to light and tho cryptogram 
is quite obviously monoalphabetic in character, the cryptanalyst is war - 
ranted, in assuming bhab the cryptogram involves a mixed cipher alphabet « 

37. Basic reason for the low degree of ; yptosocurity afforded by 
monoalphabetic cryptograms involving standard, cipher alphabets .-"The 
student has seen .that the solution of monoalphabetic cryptograms involving 
standard cipher alphabets is a very easy matter. Two methods of analysis 
were described, one involving the construction of a frequency distribu¬ 
tion, the other not requiring this kind of tabulation, being almost mech¬ 
anical in nature and correspondingly rapid. In the first of these two 
methods it was necessary to make a correct assumption as to the value of 
but one of the 26 letters of the cipher alphabet and the values of the 
remaining 25 letters at once became known; in the second method it was not 
necessary to assume "Value for even a single cipher letter. The student 
should understand what constitutes the basis of this situation, viz ., the 
fact that the two components of the cipher alphabet are composed of known 
sequences . 'What if one or both of these components are, for the crypt¬ 
analyst, unknown sequences ? In other words, what difficulties will con¬ 
front the cryptanalyst if the cipher component of the cipher alphabet is 
a mixed sequence? Will such an alphabet be solvable as a whole at one 
stroke, or will it be necessary to solve its values individually? Since 
the determination of the value of one cipher letter in this case gives no 
direct clues to the value of any other letter, it would seem that the 
solution of such a cipher should involve considerably more analysis and 
experiment than has the solution of either of the two types of ciphers so 
far examined. The steps to be taken in the cryptanalysis of a mixed- 
alphabet cipher will be discussed in the next section. 

^ There is but one other possibility, already referred to under 
subpar. 31d which involves the case where transposition and monoalphabetic 
substitution processes have been applied in successive steps. This is 
unusual, however, and will be discussed in its proper place. 
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SECTION VI 


UNILITERAL SUBSTITUTION WITH MIXED CIPHER ALPHABETS 


r Paragraph 

Literal keys and numerical keys33 
Types of mixed cipher alphabets........ 39 

Additional remarks on cipher alphabets. 4-0 

Preliminary steps in the analysis of a monoalphabetic, mixed- 

alphabet cryptogram.... 4l 

Preparation of tbe work sheet...... 4-2 

Triliteral frequency distributions. 4-3 

Classifying the cipher letters into vowels and consonants.. 44 

Further analysis of the letters representing vowels and consonants.. 4-5 
Substituting deduced values in the cryptogram....................... 4-6 

Completing the solution. 47 

General remarks on the foregoing solution..............48 

The "probable-word 11 method; its value and applicability. 4-9 

Solution of additional cryptograms produced by the same 

components.. $0 

Derivation of key words...... 51 


38» Literal keys and numerical keys .—a. As has been previously 
mentioned, most cryptosystems involve the use of a specific key to con¬ 
trol the steps followed in encrypting or decrypting a specific message 
(see subpar. 9b). Such a key may be in literal form or in numerical form* 

b. It is convenient to designate a key which is composed of letters 
as a literal key . As already mentioned, a literal key may consist of a 
single letter, a single word, a phrase, a sentence, a whole paragraph, or 
even a book; and, of course, it may consist merely of a sequence of let¬ 
ters chosen at random. 

c. Certain cryptosystems involve the use of a numerical key , which 
may consist of a relatively long sequence of numbers difficult or impos¬ 
sible for the average cipher clerk to memorize. Several simple methods 
for deriving such sequences from words, phrases, or sentences have been 
devised, aiid a numerical key produced by any of these methods is called a 
derived numerical key (as opposed to a key consisting of randomly-selected 
numbers). One of the commonly-used methods consists of assigning numer¬ 
ical values to the letters of a selected literal key in accordance with 
their relative positions in the ordinary alphabet, as exemplified in the 
following subparagraph. 
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d. Let the prearranged key word "be the word LOGISTICS. Since C, 
the penultimate letter of the key word, appears in the normal alphabet 
before any other letter of the key word, it is assigned the number 1: 

LOGISTICS 

1 

The next letter of the normal alphabet that occurs in the key word is G, 
which is assigned the number 2. The letter I, which occurs twice in the 
key word, is assigned the number 3 for its first occurrence and the num¬ 
ber 4* for its second occurrence; and so on. The final result is; 

LOGISTICS 

562379418 

This method of assigning the numbers is very flexible and varies with 
different vises to which numerical keys are put. It may, of course, be 
applied to phrases or to sentences, so that a very long numerical key, 
ordinarly impossible to remember, may be thus derived at will from an 
easily-remembered key text . 

e. As far as the cryptanalyst is concerned, the derivation of a 

• numerical key from a specific literal key is of interest to him because 
this knowledge may assist in subsequent solutions of cryptograms prepared 
according to the same basic system, or in identifying the source from 
which the literal key was selected - perhaps an ordinary book, a magazine, 
etc. However, it should he pointed out that in some instances the crypt¬ 
analyst may he unaware that a literal key has In fact been used as the 
basis for deriving a numerical key. 

39» Types of mixed cipher alphabets .—a. It will be recalled that 
in a mixed cipher alphabet the sequence of letters or characters in one 
of the components (usually the cipher component) does not correspond to 
the normal sequence. There are various methods of composing the sequence 
of letters or elements of this mixed component, and those which are based 
upon a scheme that is systematic in its nature are very useful because 
they make possible the derivation of one or more mixed sequences from any 
easily-remembered word or phrase, and thus do not necessitate the carry¬ 
ing of m’itten memoranda. Alphabets involving a systematic method of 
mixing are called systematically-mixed cipher alphabets . 

h. One of the simplest types of systematically-mixed cipher alpha¬ 
bets Is the keyword-mixed alphabet . The cipher component consists of a 
key word or phrase (with repeated”letters, if present, omitted alter 
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their first occurrence)- 1 -, followed by the letters of the alphabet in 
their normal sequence (with letters already occurring in the key omitted 
of course). Example, with GOVERNMENT as the key word; 

Plain; ABCDEFGHIJKI2®0PQPSTUVWXTZ 
C ipher; GOVERNOTABCDFHIJKLPQSUWXXZ 

c. It is possible to disarrange the sequence constituting the 
cipher component even more thoroughly by applying a simple method of 
transposition to the keyword-mixed sequence. Two common methods are 
illustrated below, using the key word TELEPHONY. 

(1) Simple columnar transposition ; 

TELPHONY 
ABCDFGIJ 
K M Q"R S tt V W 
X Z 

Mixed sequence (formed by transcribing the successive columns from left 
to right): 

TAKXEBMZLC QPDRHFSOGUNIVYJW 

(2) Numerically-keyed‘columnar transposition : 

' - 7-1-3-6-2-5-4-8 

TELPHONY 
ABCDFGIJ 
KMQRSUVW 
X Z 

Mixed sequence (formed by transcribing the columns In a sequence deter¬ 
mined by the numerical key derived from the key word itself): 

EBMZHFSLCQNIVOGUPDRTMXYJW 


1 

Mixed alphabets formed by including all repeated letters of the 
key word or key phrase in the cipher component were common in Edgar 
Allan Poe's day but are impractical because they are ambiguous, making 
decipherment difficult; an example: 


(a) Alphabet for enciphering.— 

(b) inverse form of (a), 

for deciphering.—- 


Plain: ABCDEFGHIJXIMX)PQRSTUVNXYZ 

Cipher: NOWISTHETIMEFORALLGOODMENT 

Cipher: ABCDEFGHIJKXMIOPQPSTtFWXIZ 
Plain: P VHMSGD QKAB OEF C 

L J MIN I 

X T Z 

tr 


The average cipher clerk would have considerable difficulty in decrypting 
a cipher group such as TOOET, each letter of which has three or more 
equivalents, and from which the plaintext fragments (N)lNTH,, ..IT THI(S), 
IT THI..., etc. can be formed on decipherment. 
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a. The last two systematically-mixed sequences are examples of 
transposition-mixed sequences . Almost any method of transposition may he 
used to produce such sequences* 

e. Another simple method of forming a mixed sequence is the deci ¬ 
mation method * In this method, letters in the normal alphabet, or in a 
keyword-mixed sequence, are "counted off" according to any selected In¬ 
terval. As each letter is decimated--that is, eliminated from the basic 
sequence by counting off—it is entered in a separate list to form the 
new mixed sequence* For example, to form a mixed sequence by this xnethod| 
from a keyword-mixed sequence based on the key phrase SING A SONG OF 
SIXPENCE with 7 the interval selected, proceed as follows : 

Keyword-mixed (or basic) sequence: 

SINGA0FXPECBDHJK1MQPTUWYZ 

'When the letters are counted off by 7’s from left to right, F will be the 
first letter arrived at, H the second, T the third: 

5 INGAO^XPECBDjfjKLMQR^UVWYZ 

123456J123456J1234567 

These letters are entered in a separate list (F first, H second, T third, 
and so on) and eliminated from the keyword-mixed sequence. When the 
end of the keyword-mixed sequence is reached, return to the beginning, 
skipping the letters already eliminated: 

SjfNGAO?XP?iCBDjfJKL#QR!jfUVWYZ 

123^5 

6 J 1 2 3 ^ 5 6 J l 2 3 b 5 6 7 

The decimation-mixed sequence: 

FHPmiZPQNIWCn/BSIXiGQKYJOT 

f. Practical considerations, of course, set a limit to the complex¬ 
ities that may be introduced in constructing systematically-mixed alpha¬ 
bets. Beyond a certain point there is no object in further mixing. The 
•greatest amount of mixing by systematic processes will give no more se¬ 
curity than that resulting from mixing the alphabet by random selection, 
such as by putting the 26 letters in a box, thoroughly'shaking them up, 
and then drawing the letters out one at a time* Whenever the laws of 

• chance operate in the construction of a mixed alphabet, the probability 
of producing a thorough disarrangement of letters is very great. Random - 
mixed alphabets give more cryptographic security than do the less compli¬ 
cated systematically-mixed alphabets, because they afford no clues to 
positions of letters, given the position of a few of them. Their chief 
disadvantage is that they must be reduced to writing, since they cannot 
readily be remembered, nor can they be reproduced at will from an easily- 
remembered key word. 
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Up, Additional remarks on cipher alphabets ,—a. All cipher alpha- 
"bets may be classified on the basis of their arrangement as enciphering 
or deciphering alphabets . An enciphering alphabet is one in vhich the 
sequence of letters in the plain component coincides with the normal 
sequence and is arranged in that manner for convenience in encipherment. 
In a deciphering alphabet the sequence of letters in the cipher component 
coincides with the normal, for convenience in deciphering. For example, 
(l), below, shows a mixed cipher alphabet arranged as an enciphering al¬ 
phabet; (2j shows the corresponding deciphering alphabet. An enciphering 
alphabet and its corresponding deciphering alphabet present an inverse 
relationship to each other. To invert a deciphering alphabet is to write 
the corresponding enciphering alphabet; to invert an enciphering alphabet 
is to write the corresponding deciphering alphabet. 

Enciphering Alphabet 

Plain: ;®CDEFGHIJKLMNOPQRSTUVWXYZ 

'■ 1 ' Cipher: .JKQVXZWESTRMJIOLGAPHCMYBDF 

Deciphering Alphabet 

, Cipher: ABCDEFGHIJiaMfOPQPSTUVWXYZ 
( 2 > Plain: RXUYHZQTNABrVLOSCKIJMDGEWF 

b. A series of related reciprocal alphabets maybe produced by 
juxtaposing at all possible points of coincidence two components which 
are identica!, but progress in opposite directions. This holds regardless 
of whether tlie components are composed of an even or an odd nuxnber of 
elements. The following reciprocal alphabet is one of such a series of 
26 alphabets: 

Plain: 'l^^dJLICBEFGJKMOPQGTVWXZ 

- Cipher: GFEBCILUARDYHZXWTSQpOHMKJ 

A Bingle or isolated reciprocal alphabet may be produced in one of two 
ways: 

(1) By constructing a complete reciprocal alphabet by arbitrary or 

random assignments of values in pairs. That is, if Ap is made the 
equivalent of K c , then Kp is made the equivalent of A,.; if Bp is made R c , 
then Rp is made B c , and so on. If the two components thus constructed 
are slid against each other no additional reciprocal alphabets will be 
produced. " * 

(2) By juxtaposing a sequence comprising an even number of elements 
against the same sequence shifted exactly half way to the right (or left), 
as seen below: 


ABCDEFGHIJKLMNOPQRSTUVWXyZ, 

ABCDEFGHIJKLMHOPQRSTOVWXyZABCDEFGHIJKIMIOPQRSTUVWXYZ 
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hi. Preliminary steps In the analysis of a monoalphabetic, mixed - 
alphabet cryptogram .—a. The student is now ready to resume his crypt- 
analytic studies. Note the following cryptogram: 

SFDZF IOGHL PZFGZ DYSFF HBZDS GVHTF UPLVD FGYVJ VFVBT GADZZ AITYD ZYFZJ 

ZTGFT VTZBD VFHTZ DFXSB GIDZY VTXOI YVTEF VMGZZ THLLV XZDFM HPZAI TYDZY 

BDVFH TZDFK ZDZZJ SXISG ZYGAV FSLGZ DTHHT CDZRS VTYZD OZFFH TZA3T YDZYG 

AVDGZ ZTK HI TYZ YS DZGHG ZFZTG UPGDI XWGHX ASR UZ DFU TD EGHTV EAGXX 

b. A casual inspection of the text discloses the presence of sev¬ 
eral long repetitions as well as of many letters of normally low fre¬ 
quency, such as F, G, V, X, and Z; on the other hand, letters of normally 
high frequency, such as the vowels, and the consonants N and R, are rel¬ 
atively scarce. The cryptogram is obviously a substitution cipher and 
the usual mechanical tests for determining whether it is possibly of the 
monoalphabetic, standard-alphabet type are applied. The results being 
negative, a uniliteral frequency distribution is immediately constructed, 
as shown in Figure 13, and the <j> test is applied to it. _ 



ABCDEFGHIJKLMNOPQRSTUVWXYZ 

8 4 1 23 3 19 19 IS 10 3 2 5 2 0 3 t 0 2 10 22 S 1® 1 8 14 35 

<j>p=3668 <t> r =2117 4»o=3862 

Figure 13. 

c. The fact that the frequency distribution shows very marked 
crests and troughs indicates that the cryptogram is very probably mono- 
alphabetic, and the results of the 4> test further support this hypothesis. 
The fact that the cryptogram has already been tested hy the method of 
completing the plain-component sequence and found not to he of the mono- 
alphahetic, standard-alphabet type, indicates with a high degree of 
probability that it involves a mixed cipher alphabet. A few moments might 
be devoted to making a careful inspection of the distribution to insure 
thab it cannot be made to fit the normal; the object of this would be to 
rule out the possibility that the text resulting from substitution by a 
standard cipher alphabet had not subsequently been transposed. But this 
inspection in this case is hardly necessary, in view of the presence of 
long repetitions in the message. 2 (See subpar. 25g.) 

^ This possible sbep is mentioned here for the purpose of making it 
clear tlxat the plain-component sequence completion method cannot solve a 
case in which transposition has followed or preceded monoalphabetic sub¬ 
stitution with standard alphabets. Cases of this kind will he discussed 
in a later texb. It is sufficient to indicate at this point that the - 
frequency distribution for such a combined substitution-transposition 
cipher would present the characteristics of a standard alphabet cipher 
and yet the method of completing the plain-component sequence would fail 
to bring oub any plain text* 



' .REF ID:A56092 - 

RESTRICTED . ‘ . 

d. One might, of course, attempt to solve the cryptogram hy applying 
the simple principles of frequency. One might, in other words, assume 
that Z c (the letter of greatest frequency) represents Ep, D c (the letter 
of next greatest frequency) represents Tp, and so on. If the message 
were long enough this sin pie procedure might more or less quickly give 
the solution. But the message is relatively short and many difficulties 
would he encountered. Much time and effort would he expended unnecessar¬ 
ily, because it is hardly to he expected that in a message of only 235 
letters the relative order of frequency of the various cipher letters 
should exactly coincide %rlth, or even closely approximate the relative 
order of frequency of letters of normal plain, text found in a count of 
50,000 letters. It Is to he emphasized that the beginner roust repress 
the natural tendency to place too much confidence in the generalized prin ¬ 
ciples of frequency and to rely too much upon them . It is far better to 
bring into effective use certain other data concerning normal plain text, 
such as digraphic and trigraphic frequencies, 

kjZj, Preparation of the work 3heet .—a. The details to be- considered 
in this paragraph may at first appear to be superfluous, but long expe¬ 
rience has proved that systematization of the work and preparation of the 
data in the most utilizable, condensed form is most advisable, even if 
this seems to take considerable time. In the first place, if it merely 
serves to avoid interruptions and irritations occasioned by failure to 
have the data in an instantly available form, it will pay by saving men¬ 
tal wear and tear. In the second place, especially in the case of com¬ 
plicated cryptograms, painstaking care in these details, while it may not 
alwayB bring about success, is often the factor that is of greatest 
assistance in ultimate solution. The detailed preparation of the data 
may be irksome to the student, and he may be tempted to avoid as much of 
it as poss'ible, but, unfortunately, in the early stages of solving a 
cryptogram he does not know (nor, for that matter, does the expert always 
know) just which data are essential and which may be neglected. Even 
though not all of the data may turn out to have been necessary, as a gen¬ 
eral rule, time is saved in the end if all the usual data are prepared as 
a regular preliminary to the solution of most cryptograms, 

b. First, the cryptogram is recopied in the form of a work sheet . 
This sheet should be of a good quality of paper so as to withstand con¬ 
siderable erasure. If the cryptogram is to be copied by hand, cross- 
section paper of jj-inch squares is extremely useful. The writing should 
be in ink, and plain, carefully-made roman capital letters should be used 
in all cases .3 If the cryptogram is to be copied on a typewriter, the 
ribbon employed should be impregnated with an ink that will not smear or 
smudge under the hand. 


q 

It is advisable to use, for this purpose, the system of standardized 
manual printing adopted by Service communications personnel. The use of 
this system, which is included in Appendix J f assures that work sheets 
are completely legible, not only to the person preparing them, but to 
others as well. 
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£. The arrangement of the characters of the cryptogram on the work 
sheet is a matter of considerable importance. If the cryptogram as first 
obtained is in groups of regular length (usually five characters to a 
group) and if the uniliteral frequency distribution shows the cryptogram 
to be monoalphabetic, the characters should be copied without regard to 
this grouping. It Is advisable to allow one space between letters (this 
is especially true for work sheets prepared on the typewriter), and to 
write a constant number of letters per line, approximately 25. At least 
two spaces, preferably three spaces, should be left between horizontal 
lines, to allow room for multiple assumptions. Care should be taken to 
avoid crowding the letters in any case, for this is not only confusing to 
the eye but also mentally irritating when later It is found that not 
enough space has been left for making various sort 3 of marks or indica¬ 
tions . If the cryptogram is originally In what appears to be word lengths 
(and this is the case, as a rule, only with the cryptograms of amateurs), 
naturally It should be copied on the work sheet in the original group¬ 
ings « If further study of a cryptogram shows that some special grouping 
is required, it is often best to recopy it on a fresh work sheet rather 
than to attempt to indicate the new grouping on the old work sheet. 

d. In order to be able to locate or refer to specific letters or 
groups of letters with speed, certainty, and without possibility of con¬ 
fusion, it is advisable to use coordinates applied to the lines and 
columns of the text as It appears on the work sheet. To minimize possi¬ 
bility of confusion, it Is best to apply letters to the horizontal lines 
of the text, numbers to the vertical columns. In referring to a letter, 
the horizontal line in which the letter is located is usually given first. 
Thus, referring to the work sheet shown below, coordinates A17 designate 
the letter Y, the 17th letter in the first line. The letter I is usually 
omitted from the series of line indicators so as to avoid confusion with 
the figure 1. If lines are limited to 25 letters each, then each set of 
100 letters of the text is automatically blocked off by remenibering that 

4 lines constitute 100 letters. 

e. Above each character of the cipher text may be some indication 
of the frequency of that character in the whole cryptogram. This indi¬ 
cation may be the actual number of times the character occurs, or, if 
colored pencils are used, the cipher letters may be divided up into three 
categories or groups—high-frequency,- medium-frequency, and low-frequency. 
It "Is perhaps simpler, if clerical help is available, to indicate the 
actual frequencies. This saves constant reference to the frequency 
tables, which interrupts the train of thought, and saves considerable 
time In the end, since it enables the student better to visualize fre ¬ 
quency-pat terns of words. In any case, it is recommended that the fre¬ 
quencies of the letters comprising the repetitions be inscribed over their 

* In some cryptosystem, certain low-frequency letters are employed as 
word separators to indicate the end of a word; if the meaning of these 
letters is discovered, it is tantamount to having the cryptogram in word 
lengths and thus the work sheet is made accordingly. See also in this 
connection the treatment on word separators in Section VII. 
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respective lettersj likewise, the frequencies of the first 10 and last 10 
letters should also he inscribed, as these positions often lend themselves 
readily to attack.' 1 

f. After the special frequency distribution, explained in Par. ^3 
below, has been constructed, repetitions of digraphs and trigraphs should 
be underscored. In so doing, the student should be particularly watchful 
for trigraphic repetitions which can be further extended into tetragraphs 
and polygraphs of greater length. Repetitions of more than ten charac¬ 
ters should be set off by heavy vertical lines, as they indicate repeated 
phrases and are"of considerable assistance in solution. If a repetition 
continues from one line to the next, put an arrow at the end of the under¬ 
score to signal this fact. Reversible digraphs and trigraphs should also 
be indicated by an underscore with an arrow pointing in both directions • 
Anything which strikes the eye as being peculiar, unusual, or significant 
as regards the distribution or recurrence of the characters should be 
noted. All these marks should, if convenient, be made with ink so as not 
to cause smudging. The work sheet will now appear as shewn below (not 
all the repetitions are underscored): 
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See Appendix h in this connection. 


REF ID:A56892 


RESTRICTED 

f t3» Trillteral frequency dis br ibut ions , —a, Jn what has gone before, 
a type of frequency distribution known as“a uniliteral frequency distri¬ 
bution was used. This, of course, shows only the number of times each 
individual letter occurs• In order to apply the normal digraphic and 
trigraphic frequency data (given in Appendix 2) to the solution of a 
cryptogram of the type now being studied, it is obvious that the data 
with respect to digraphs and trigraphs occurring in the cryptogram should 
be compiled and should be compared with the data for normal plain text. 

In order to accomplish this in suitable manner, it is advisable to con¬ 
struct a more comprehensive form of distribution termed a triliteral 
frequency distribution .® 

b. Given a cryptogram of 50 or more letters and the task of deter¬ 
mining what trigraphs are present in the cryptogram, there are three ways 
in which the data may be arranged or assembled. One may require that the 
data show (l) each letter with its two succeeding letters} (2) each let¬ 
ter with its two preceding letters; (3) each letter with one preceding 
letter and one succeeding letter. 

£. A di stribution of the first of the three foregoing types may be 
designated as a "triliteral frequency distribution showing two suffixes"; 
the second type may be designated as a "triliteral frequency distribution 
showing two prefixes"; the third type may be designated as a "triliteral 
frequency distribution showing one prefix and one suffix." Quadrilateral 
and pentaliteral frequency distributions may occasionally be found useful. 

d. Which of these three arrangements is to he employed at a specific 
time depends largely upon what the data are intended to show. For present 
purposes, in connection with the solution of a monoalphabetic substitution 
cipher employing a mixed alphabet, possibly the third arrangement, that 
showing one prefix and one suffix, is most satisfactory, 

e. It is convenient to use -jj-inch cross-section paper for the con¬ 
struction of a triliteral frequency distribution in the form of a distri¬ 
bution showing crests and troughs, such as that in Figure l4. In that 
figure the prefix to each letter to be recorded is inserted in the left 
half of the cell directly above the cipher letter being recorded; the 
suffix to each letter is inserted in the right half of the cell directly 
above the letter being recorded; and in each case the prefix and the suf¬ 
fix to the letter being recorded occupy the same cell, the prefix being 
directly to the left of the suffix. The number in parentheses gives the 
total frequency for each letter. 


It is felt advisable here to distinguish between two closely related 
terms. A triliteral distribution of A B C D E F would consider the 
groups A B C, B C D, C D E, D E F; a trigraphic distribution would con¬ 
sider only the trigrnphs ABC and D E F, (See also subpar, 23d.) 
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f. The triliteral frequency distribution is now to be examined with 
a view to ascertaining what digraphs and trigraphs occur two or more 
times in the cryptogram* Consider the pair of columns containing the 
prefixes and suffixes to D c in the distribution, as shown in Fig* 14. 

This pair of columns shows that the following digraphs appear in the 
cryptogram: 


Digreiphs based on prefixes 
(arranged as one reads up 
the column) 

FD, ZD, 2D, VD, AD, YD, BD, 
ZD, ID, ZD, YD, BD, ZD, ZD, 
ZD, CD, ZD, YD, VD, SD, GD, 
ZD, ID 


Digraphs based on suffixes 
(arranged as one reads up 
the column) 

DZ, DY, DS, DF, DZ, DZ, DV, 
DF, DZ, DF, DZ, DV, DF, DZ, 
DT, DZ, DO, DZ, DG, DZ, DI, 
DF, DE 


The nature of the triliteral frequency distribution is such that in find¬ 
ing What digraphs are present in the cryptogram it is immaterial whether 
the prefixes or the suffixes to the cipher letters are studied, so long 
as one is consistent in the study . For example, in the foregoing list of 
digraphs based on the prefixes to D c , the digraphs FD, ZD, ZD, VD, etc., 
are found; if now, the student will refer to the suffixes of F c , Zc, V c , 
etc., he will find the very same digraphs Indicated. This being the case, 
the question may be raised as to what value there is In listing both the 
prefixes and the suffixes to the cipher letters. The answer is that by 
so doing the trigraphs are indicated at the same time. For example, in 
the case of D c , the foilwing trigraphs are indicated: 


FDZ, ZDY, ZDS, VDF, ADZ, YDZ, BDV, ZDF, IDZ, ZDF, YDZ, BDV, ZDF, 

ZDZ, ZDT, CDZ, ZDO, YDZ, VDG, SDZ, GDI, ZDF, IDE. 

g. The repeated digraphs and trigraphs can now be found quite read¬ 
ily. Thus, in the case of D c , examining the list of digraphs based on 
suffixes, the following repetitions are noted: 


DZ appears 9 times; DF appears 5 times; DV appears 2 times 


Examining the trigraphs with D c as central letter, the following repeti¬ 
tions are noted: 


ZDF appears 4 times; YDZ appears 3 times; BDV appears 2 times 

h. It is unnecessary, of course, to go through the detailed proce¬ 
dure set forth in the preceding subparagraphs in order to find all the 
repeated digraphs and trigraphs. The repeated trigraphs with D c as cen¬ 
tral letter can be found merely from an inspection of the prefixes and 
suffixes opposite D c in the distribution. It is necessary only to find 
those cases in which two or more prefixes are identical at the same time 
that the suffixes are identical. For example, the distribution shows at 
once that in four cases the prefix to D c is Z c at the same time that the 
suffix to this letter is F c . Hence, the trigraph ZDF appears four times. 
The repeated trigraphs may all be found in this manner. 
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f. The trilateral frequency distribution is now to "be examined with 
a view to ascertaining what digraphs and trigraphs occur two or more 
times in the cryptogram. Consider the pair of columns containing the 
prefixes and suffixes to D c in the distribution, as shown in Fig. l4. 

This pair of columns shows that the following digraphs appear in the 
cryptogram: 


Digraphs based on prefixes 
(arranged as one reads up 
the column) 


Digraphs based on suffixes 
(arranged as one reads up 
the column) 


FD, ZD, 2D, VD, AD, 
ZD, ID, ZD, YD, BD, 
ZD, CD, ZD, YD, VD, 
ZD, ID 


YD, 

BD, 

DZ 

ZD, 

ZD, 

DF 

SD, 

GD, 

DT 

■ 


DF 


DY, D3, DF, DZ, DZ, DV, 

DZ, DF, DZ, DV, DF, DZ, 
DZ, DO, DZ, DG, DZ, DI, 
DE 


The nature of the triliteral frequency distribution is such that in find¬ 
ing what digraphs are present in the cryptogram it is immaterial whether 
the prefixes or the suffixes to the cipher letters are studied, so long 
as one is consistent in the study . For example, in the foregoing list of 
digraphs based on the prefixes to D c , the digraphs FD, ZD, ZD, VD, etc., 
are found; if now, the student will refer to the suffixes of F c , v c , 
etc., he will find the very same digraphs indicated. This being the case, 
the question may be raised as to what value there is in listing both the 
prefixes and the suffixes to the cipher letters. The answer is that by 
so doing the trigraphs are indicated at the same time. For example, in 
the case of D c , the following trigraphs are indicated: 


FDZ, ZDY, ZDS, VDF, ADZ, YDZ, BDV, ZDF, IDZ, ZDF, YDZ, BDV, ZDF, 

ZDZ, ZDT, CDZ, ZDO, YDZ, VDG, SDZ, GDI, ZDF, IDE. 

jg. The repeated digraphs and trigraphs can now be found quite read¬ 
ily. Thus, In the case of D c , examining the list of digraphs based on 
suffixes, the following repetitions are noted: 

DZ appears 9 times; DF appears 5 times; DV appears 2 times 

Examining the trigraphs with D c as central letter, the following repeti¬ 
tions are noted: 


ZDF appears 4 times; YDZ appears 3 times; BDV appears 2 times 

h. It Is unnecessary, of course, to go through the detailed proce¬ 
dure set forth In the preceding subparagraphs in order to find all the 
repeated digraphs and trigraphs. The repeated trigraphs with D c as cen¬ 
tral letter can he found merely from an inspection of the prefixes and 
suffixes opposite D c in the distribution. It is necessary only to find 
those cases in which two or more prefixes are identical at the same time 
that the suffixes are identical. For example, the distribution shows at 
once that in four cases the prefix to D c is Z c at the same time that the 
suffix to this letter is F c . Hence, the trigraph ZDF appears four times. 
The repeated trigraphs may all be found in this manner. 
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i. The most frequently repeated digraphs and trigraphs are then 
assembled in wliab is termed a condensed table of repetitions , so as bo 
bring this information prominently before the eye. As a rule, in mes¬ 
sages of average length, digraphs which occur less than four or five 
times, and trigraphs whic’h'occur less than three or four times may be 
omitted from the condensed table as being relatively of no importance in 
the study of repetitions. In the condensed table the frequencies of the 
individual letters forming the most important digraphs, trigraphs, etc., 
should be indicated. 

44. Classifying the cipher letters into vowels and consonants .— 

a. Before proceeding to a detailed analysis of the repeated digraphs and 
trigraphs, a very important step can be taken which will be of assistance 
not only in the analysis of the repetitions but also in the final solu¬ 
tion of the cryptogram. This step concerns the classification of the 
high-frequency cipher letters into two groups—(l) those which most prob¬ 
ably represent vowels, and (2) those which most probably represent con¬ 
sonants. For if the cryptanalyst can quickly ascertain the equivalents 
of the four vowels. A, E, I, and 0, and of only the four consonants, N, 

R, S, and T, he will then have the values of approximately two-thirds of 
all the cipher letters that occur in the cryptogram; the values of the 
remaining letters can almost be filled in automatically. 

b, The basis for the classification will be found to rest upon a 
comparatively simple phenomenon: the associations! or combinatory be¬ 
havior of vowels is, in general, quite different from that of consonants. 
If an examination be made of Table J-Z in Appendix 2, showing the rela¬ 
tive order of frequency of the l8 digraphs composing 25 percent of Eng¬ 
lish telegraphic text, it will be seen that the letter E enters into the 
composition of 9 of the l8 digraphs; that is, in exactly half of all the 
cases' the letter E is one of the two letters forming the digraph. The 
digraphs containingare as follows: 

ED EN ER ES 

HE RE SE TE VE 

The remaining nine digraphs are as follows: 

/TT HD OR ST 

in m th 

ON TO 

c. None of the 18 digraphs is a combination of vowels . Note now 
that of the 9 combinations with E, 7 are with the consonants N, R, S, and 
T, one is with D, one is with V, and none is with any vowel . In other 
words, Ep combines most readily with consonants but not with other vowels, 
or even with itself. Using the terras often employed in the chemical ana- 
logy, E shows a great "affinity" for the consonants N, R, S, T, but not 
for the vowels. Therefore, if the letters of highest frequency occurring 
in a given cryptogram are listed, together with the number of times each 
of them combines with the assumed cipher equivalent of Ep, those which 
show considerable combining power or affinity for the cipher equivalent 
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of Ep may "be assumed to he the cipher equivalents of N, &, S, Tp; those 
which do not show any affinity for the cipher equivalent of Ep may he as¬ 
sumed to he the cipher equivalents of A, I, 0, Up. Applying these prin¬ 
ciples to the problem in hand, and examining the triliteral frequency 
distribution, it is quite certain that Zc=Ep, not only because Z c is the 
letter of highest frequency, but also because it combines with several 
other high-frequency letters, such as D c , F c , G c , etc. The nine letters 
of next highest frequency are: 

23 22 19 19 16 15 lh- 10 10 

DTFGVHYS I 


Let the combinations these letters form with Z c be indicated in the fol¬ 
lowing manner: 

Number of times Z c ^ 
occurs as prefix— g s 

Cipher Letter-D(23) T(22) F(19) G(l9) V(l6) H(l5) Y(l4) S(l0) l(l0) 

Number of times Z c 0 0 ^ 0 

occurs as suffix—- s 


d, Consider D c . It occurs 23 times in the message and 18 of those 
times it is combined with Z c , 9 times in the form Z C D C (=E9p), and 9 times 
in the form DcZc (s0Ep). It is clear that D c must be a consonant. In 
the same way, consider T c , which shows 9 combinations with Z c , k in the 
form ZcTc (=E9p) and 5 in the form T C Z C (=©Ep). The letter T c appears to 
represent a consonant, as do also the letters F c , G c , and Y c . On the 
other hand, consider V c , occurring in all 1 6 times but never in combina¬ 
tion with Z c j it appears to represent a vowel, as do also the letters He, 
Sc, and Ic. So far, then, the following classification would seem logical: 


Vowels 

z c(= E p)i V c , H c , S c , I c 


Consonants 
D c , T c , F c , G c , Y c 


45. Further analysis of the letters representing vowels and conso ¬ 
nants .—a. Op is usually the vowel of second highest frequency. Is it 
possible to determine which of the letters V, H, S, I c is the cipher 
equivalent of Op? Let reference be made again to Table 6 in Appendix 2, 
where it is seen that the 10 most frequently occurring diphthongs are: 

Diphthong-10 OU EA El AI IE AU EO AY UE 

Frequency-4l 37 35 27 17 13 13 12 12 11 

If V, H, S, I c are really the cipher equivalents of A, I, 0, Up (not 
respectively), perhaps it is possible to determine which is which by 
examining the combinations they make among themselves and with Z c T=Ep)• 
Let the combinations of V, S, S, I, and Z that occur in the message-be 
listed. There are only the following: 

ZZ C —k VHq—2 KH C —1 EI C —1 IS C —1 SV C —1 

ZZ C is of course EE^. Note the doublet HH C ; if H c is a vowel, then the 
chances are excellent that H^O-d because the doublets AA^, Up, UUp, are 
practically non-existent, whereas the double vowel combination 00p is of 
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next highest .frequency to the double vowel combination EEp. If H c =Op, 
then Vc must be Ip because the digraph VHc occurring two times in the 
message could hardly be A0 p , or U0 p , whereas the dipthong I0p is the one 
of high frequency in English, So far then, the tentative (because so far 
unverified) results of the analysis are as follows: 

Z^sEp H c =Op V c =I p 

This leaves only two letters, I c and S c (already classified as vowels) to 
be separated into Ap and Up, Note the digraphs: 

Hic-oep ■ is c =ee p sv c =©ip 

Only two alternatives are open: 

(1) Either I c =A p and S c =U p , 

( 2 ) Or I c =U p and S^sAp, 

If the first alternative is selected, then 

BT c sOAp IS c sAU p SV c =UI p 

If the second alternative is selected, then 

HI c =OU p IS c =UAp SV c =AIp 

The eye finds it difficult to choose between these alternatives; but sup¬ 
pose the frequency values of the plaintext diphthongs as given in Table 6 
of Appendix 2 are added for each of these alternatives, giving the follow¬ 
ing: 

HI 0 sOAp, frequency valuer 7 
SV c rUIp, frequency values 5 
ISc=ATJj), frequency value=13 


Total. 25 

Mathematically, the second alternative appears to be more probable than 
the first J Let It be assumed to be correct and the following (still 
tentative) values are now at hand: 

Zc = E p H c =Op V C =I P S c =A p 1 q-U p 

b. Attention is now directed to the letters classified as conso¬ 
nants: How far is It possible to ascertain their values? The letter D c , 
from considerations of frequency alone, would seem to be T p , but its 
frequency, 23, is not considerably greater than that, for T c . It is not 

7 A more accurate guide for choosing between the alternative groups 
of digraphs could be obtained through a consideration of the logarithmic 
weights of their assigned probabilities, rather than their plaintext 
frequency values. These weights are given in Appendix 2, along with an 
explanation of the method for their derivation; a detailed treatment of 
their application is presented in Military Cryptanalysis, Part II, 


HI c :OUp, frequency value=37 
SV c =AIp, frequency value=17 
IS c =UA p , frequency values 5 

Total- 59 
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much greater than blmb for F c or G c , yith. a fro policy of 19 each, Bxxt 
perhaps it is possible to ascerbain not the value of one letter alone hut 
of two letters at one stroke. To do this one may make use of a tebxa- 
graph of considerable importance in English, viz ., TIOUp. For if the 
analysis pertaining to the vowels is correct, and if VEc-IOp, then an 
examination of the letters immediately before and after the digraph VH C 
in the ciphei’ text might disclose both Tp and Up. Refei'ence to the text 
gives the following: 

GVHT C TW C 

' ©ioe p ei09 p 

The letter T c follows VH C in both cases and very probably indicates that 
Tc = Npj but as to whether G c or F c equals Tp cannot be decided. However, 
two conclusions are clear: first, the letter D c is neither Tp nor lip, 
from which it follows that it must be either Rp or Sp; second, bhe let¬ 
ters G c and F c must be either Tp and Sp, respectively, or Sp and Tp, 
respectively, because the only tetragraphs usually found (in English) 
containing the diphthong I0p as central letters are SIONp and TIOUp. 

This in turn means that as regards D c , tTxe labter cannot be either Rp or 
Sp; it must be Rp, a conclusion which is corroborabed by the fact that 
ZDc (sERp) and DZc (=REp) occur 9 times each. Thus far, then, the iden¬ 
tifications, when inserted in an enciphering alphabet, are 'as follows: 

Plain- ABCDEFGHXJKLMNOPQRSTUVWXYZ 

Cipher-S Z V TH DGFI 

F G 

46. Substituting deduced values in the cryptogram .—a. Thus far the 
analysis has been almost purely hypothetical, for as yet not a single one 
of the values deduced from the foregoing analysis has been tried out in 
the cryptogram. It is high time that this be done, because the final 
test of the validity of the hypotheses, assumptions, and identifications 
made in any cryptographic study is, after all, only this: do these hy¬ 
potheses, assumptions, and identifications ulbimately yield verifiable, 
intelligible plain text when consistently applied to the cipher text? 

b. Ab the present stage in the process, since there are at band the 
assumed values of bub 9 out of the 25 letters that appear*, it is obvfous 
that a continuous "reading" of the cryptogram can certainly not be expect¬ 
ed from a mere insertion of the values of the 9 letters. However, the 
substitution of these values should do two things. First, it should 
immediately disclose the fragments, outlines, or "skeletons" of "good" 
words in the text; and second, it should disclose no places in the text 
where "impossible" sequences of letters are established. By the first 
is meant that the partially deciphered text should show the outlines or 
skeletons of words such as may be expected to be found in the communi¬ 
cation; this will become quite clear in the next subparagraph. By the 
second is meant that sequences, such as "AOOEIT" or "TERSEITO" or the like, 
obviously not possible or extremely unusual in normal English text, must 
not result from the substitution of the tentative identifications result¬ 
ing from the analysis. The appearance of several such extremely unusual 
or impossible sequences would at once signify that one or more of the as¬ 
sumed values is incorrect. 
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c. Hero are the results of substituting the nine values which have 
"been deduced by the reasoning based on a classification of the high- 
frequency letters into vovels and consonants and the study of the members 
of the two groups: 

1 2 3 4 6 « 7 8 9 10 U 12 13 14 1$ U 17 18 19 20 21 22 23 24 28 


SB 19 19 85 23 14 10 


35 23 10 


SFCZFIOGHLPZFGZDYSFFHBZDS 


A I RE T 


E T S E R 
S T 


ERA 


19 U 15 22 19 5 8 5 10 23 19 19 14 10 

B GVHTFUPLVDFGYV 
SXO'NT XRTS I 

T S ST 


10 19 16 18 22 19 8 23 35 36 

VFVHTGADZZ 
ITIONS REE 
S T 


8 10 22 14 23 35 14 19 35 8 36 22 19 5 22 10 22 35 4 23 10 10 16 22 35 

O AITYDZYFZJZTGPTVTZBDVFHTZ 
N RE TE ENS NINE RITONE 

ST S 

23 19 8 10 4 19 10 23 35 14 10 22 8 3 10 14 10 22 3 19 16 2 19 35 35 

D D F X S B G I'D ZYVTXOIYVTEFVMGZZ 
RT A S RE IN IN TI SEE 

ST ST 

22 15 5 5 10 8 35 23 19 2 16 22 35 8 10 22 14 23 35 14 4 23 10 19 15 

E thlLvxzdfmhtzaitydzybdvfh 

NO I ERT ONE N RE RITO 
S S 

22 35 23 19 2 35 23 35 35 8 10 8 10 10 10 36 14 19 8 10 19 10 6 19 35 

F TZDFKZDZZJSXISGZYGAVFSLGZ 

NERT EREE A ASE S ITA SE 
S T T S T 

23 22 15 15 22 1 23 35 2 10 10 22 14 35 23 3 35 19 10 16 22 35 8 10 22 

G DTHHTCDZRSVTYZDOZFFHTZAIT 

RNOON RE AIN ER ETTONE N 

S S 

14 23 35 14 19 8 10 23 19 35 35 22 2 15 10 22 14 35 14 10 23 35 10 15 5 

H YDZYGAVDGZZTKHITYZYSDZGHU 

RE S IRSEEN 0 N E ARESO 
T T T 

85 19 35 22 10 5 6 10 23 10 8 1 24 15 8 8 10 2 6 86 23 10 6 10 23 

J ZFZTGUPGDIXWGHXASRUZDFUID 

ETENS SR SO A ERT R 

S T T T S 


8 10 16 22 18 3 8 10 8 8 

K EGHTVEAGXX 
S O N I 1 S 

T T 
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d, Ho impossible sequences are hrought to light, and, moreover, 
several long words, nearly conrplene, stand out in the text. Note the 
following portions: A21 

HBZDSGVHTP 

(1) 07ERASI0NT 

T S 

CIS 

TVTZBDVFHTZDF 

(2) NINE7RIT0NERT 

S S 

F22 

SLGZDTHHT 

(3) A7SERN00N 

T 


The words are obviously OPERATIONS, NINE PRISONERS, and AFTERNOON. The 
value G c is clearly Tpj that of F c is Spj and the following additional 
values are certain: 

Bc=Pp L c =Fp 


bf, Completing the solution ,—a. Each time an additional value is 
obtained, substitution is at once made throughout the cryptogram. This 
leads to the determination of further values. In an ever-widening circle, 
until all the identifications are firmly and finally established, and 
the message Is completely solved. In this case the decipherment is as 
follows: 

X 2 3 4 6 0 7 8 0 10 11 12 13 14 15 10 17 18 10 20 21 22 23 24 25 


A 

B 

O 

D 


E 


F 

G 

IT 

J 


SFDZFIOGHLPZF 

ASRESULTOFYES 

GVHTFUPLVDFGY 

TIONSBYFIRSTD 

AITYDZYFZJZTG 

HUNDREDSEVENT 

DFXSBGIDZYVTX 

RSCAPTKREDINC 

THLLVXZDFMHTZ 

NOFFICERSXONE 

TZDFKZDZZJSXI 

NERSWEREEVACU 

DTHHTCDZRSVTY 

RN00NQREMAIND 

YDZYGAVDGZZTK 

DREDTHIRTEENW 

ZFZTGUPGDIXWG 

ESENTBYTRUCKT 


GZDYSPFHBZDS 

TERDAYSOPERA 

VJVFVHTGADZZ 

IVISIONTHREE 

PTVTZBDVFHTZ 

YNINEPRISONE 

OIYVTEFVMGZZ 

LUDINGSIXTEE 

AITYDZYBDVFH 

HUNDREDPRISO 

SGZYGAVFSLGZ 

ATEDTHISAFTE 

ZDOZFFHTZAIT 

ERLESSONEHUN 

HITYZYSDZGHU 

0UNDEDARET0B 

HXASRUZDFUID 

0CHAMBERS3UR 


K 


EGHTVEAGXX 

GT0N1GHTXX 
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Message: AS RESULT OP YESTERDAYS OPERATIONS BY FIRST DIVISION THREE 
HUNDRED SEVENTY NINE PRISONERS CAPTURED INCLUDING SYSTEM OFFICERS ONE 
HUNDRED PRISONERS HERE EVACUATED THIS AFTERNOON REMAINDER LESS ONE HUN¬ 
DRED THIRTEEN WOUNDED ARE TO BE SENT BY TRUCK TO CH/&ZBERSBURG TONIGHT 

b, The solution should, as a rule, not he considered complete until 
an attempt has "been made to discover all the elements underlying the gen¬ 
eral system and the specific key to a message. In this case, there is no 
need to delve further into the general system, for it is merely one of 
unillteral substitution with a mixed cipher alphabet. It is necessary 
or advisable, however, to reconstruct the cipher alphabet because this 
may give clues that later may become valuable. 

£, Cipher alphabets should, as a rule, be reconstructed by the 
cryptanalyst in the Torn of enciphering alphabets because they will then 
usually be in the form in which the encipherer used them. This is impor¬ 
tant for two reasons. First, if the sequence in the cipher component 
gives evidence of system in its construction or if it yields clues point¬ 
ing toward its derivation from a key word or a key phrase, this may often 
corroborate the identifications already made and may lead directly to 
additional identifications. A word or two of explanation is advisable 
here. For example, refer to the skeletonized enciphering alphabet given 
at the end of subpar, h 5b: 


Plain- ABCDEFGHIJKLMN OPQRSTUVWXYZ 

Cipher-S Z V TH DGFI 

F G 


Suppose the cryptanalyst, looking at the sequence DGFI or DFGI in the 
cipher component, suspects the presence qf a keyword-mixed alphabet. 

Then DFGI is certainly a more plausible sequence than DGFI, Examining 
the skeleton cipher component more carefully, he notes that S . . , Z 
would allow for insertion of three of the missing letters UWXY, since the 
letters T and V occur later, probably in the keyword itself j further, he 
notes that the key word probably begins under Fp and ends in TH, making 
it probable that the TH is followed by AB or BC. This would mean that 
either P, Qp=A, B e or B, C c . Assuming that P, Qp=A, B c , he refers to the 
frequency distribution and finds that the assumptions PpsA c and Qp=B c are 
not goodj on the other hand, assuming thab P, Qp=B, Cc, the frequency 
distribution gives excellent corroboration. A trial of these values 
would materially hasten solution because it is often the case in crypt¬ 
analysis that if the value of a very low-frequency letter can be surely 
established it will yield clues to other values very quickly. Thus, 
if Qp is definitely identified it almost invariably will identify Up, and 
will give clues to the letter following the Up, since it must be a vowel. 
In the case under discussion the identification P, QprB, C c would have 
turned out to be correct. For the foregoing reason an attempt should 
always be made in the early stageB of the analysis to determine, if 
possible, the basis of construction or derivation of the cipher alphabet^ 
as a rule this con he done only by means of the enciphering alphabet, and 
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not the deciphering alphabet. For example, the skeletonized deciphering 
alphabet corresponding to the enciphering alphabet directly above is as 
follows: 


Cipher- ABCDEFGHIJKLMBOPQRSTUVWXYZ 

Plain- R T S 0 U AN 1 S 

S T 


Here no evidences of a keyword-mixed alphabet are seen at all* However, 
if the enciphering alphabet has been examined and shows no evidences of 
systematic construction, the deciphering alphabet should then be examined 
with this in view, because occasionally it is the deciphering alphabet 
which shows the presence of a key or keying element, or which has been 
systematically derived from a word or phrase. The second reason why it is 
important to try to discover the basis of construction or derivation of 
the cipher alphabet is that it affords clues to the general type of key 
words or keying elements employed by the enemy. This is a psychological 
factor, of course, and may be of assistance in subsequent studies of his 
traffic. It merely gives a clue to the general type of thinking indulged 
in by certain of his cryptographers. 

d. In the case of the foregoing solution, the complete enciphering 
alphabet is found to be as follows: 


Plain- ABCDBFGHIJXLMHOPQR'STUVWXYZ 

Cipher- SUXYZLBAVHWORTHBCDFOIJKMP 


Obviously, the letter Q, which is the only letter not appearing in the 
cryptogram, should follow P in the cipher component. Note now that the 
latter is based upon the keyword IEAVENWORTH, and that this particular 
cipher alphabet has been composed by shifting the mixed sequence based 
upon this keyword five intervals to the right so that the key for the 
message is Ap-S c Note also that the deciphering alphabet fails to give 
any evidence of keyword construction based upon the word LRAVEIWORTH. 


Cipher- ABCDEFGHIJKLMHOPQRSTUVWXYZ 

Plain- HPQRGSTOUVWFXJLYZMAHBIKCDE 


e. If neither the enciphering nor the deciphering alphabet exhibits 
characteristics which give indication of derivation from a key vord by 
some form of mixing or disarrangement, the use of such a key word for 
this purpose is nevertheless not finally excluded as a possibility. For 
the reconstruction of such mixed alphabets the cryptanalyst must use 
ingenuity and a knowledge of the more common methods of suppressing the 
appearance of key words in the mixed alphabets. Several of these methods 
are given detailed treatment in par. 51 below. 

f. It is very important in practical cryptanalytic work to pre¬ 
pare a technical summary of the solution of a system. Step-by-step 

- n" -• - - 

° It is usual practice to employ as the specific key the equivalent 
of either A-n, or the equivalent of the first letter of the plain compo- 
nenb when this component is a mixed sequence. 


9b 
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commentaries should accompany an initial solution; the steps taken should 
he jotted down as they are made , and at the end they should he coirfbined 
into a complete r£sum? "of the analysis. The r£sum£ should he brief and 
concise, yet comprehensive enough that at any future time the solution 
may he reconstructed following the exact manner in which it was origi¬ 
nally accomplished. Assumptions of words, etc,, should he referred to 
with work sheet line- and column-indicators, and should he couched in the 
proper cryptologic language or symbols, A short exposition of the mech¬ 
anics of the general system, enciphering alphabets, enciphering diagrams, 
etc., as well as all key words (together with their derivation) and spe¬ 
cific keys should he included. On the work sheet there should he a 
letter-for-letter decryptment under the cipher text; the final plaintext 
version 'should be in word lengths, with any errors or garbles corrected, 
Nulls or indicators showing sentence separation, change of key, etc,, may 
he enclosed in parentheses. All work sheets and notes should he kept 
together with the solution. 

48, General remarks on the foregoing solution ,—a. The example 
solved above is admittedly a more or less artificial illustration of the 
steps in analysis, made so in order to demonstrate general principles. 

It was easy to solve because the frequencies of the various cipher let¬ 
ters corresponded quite well with the normal or expected frequencies. 
However, all cryptograms of the same monoalphabetical nature can he 
solved along the same general lines, after a certain amount of experi¬ 
mentation, depending upon-the length of the cryptogram, and the skill 
and experience of the cryptanalyst.9 

h. It is no cause for discouragement if the^student’s initial 
attempts to solve a cryptogram of this type require much more time and 
effort than were apparently required in solving the foregoing purely 
illustrative example. It is indeed rarely the case that every assumption 
made by the cryptanalyst proves in the end to have been correct; more 
often it is the case that a good many of his initial assumptions are in¬ 
correct, and that he loses much time in casting out the erroneous ones. 
The speed and facility with which this elimination process is conducted 
is in many cases all that distinguishes the expert from the novice. 


o 

The use of monoalphahetic substitution in modern military operations 
is exceedingly rare because of the simplicity of solution. However, such 
cases have occurred, and one rather illuminating instance may be cited# 

In an important communication on 5 August 1918, General Kress von 
Kressenstein used a single mixed alphabet, and the intercepted radio mes¬ 
sage was solved at American GHQ very speedily, A day later another mes¬ 
sage, but in a very much more difficult cipher system, was intercepted 
and solved. When translated, it read as follows 1 

"GHQ Kress: 

The cipher prepared by General von Kress was at once solved here# 

Its further use and employment is forbidden. 

Chief Signal Officer, Berlin,” 
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o_. Nor 'will the student always find that the initial classification 
into vowels and consonants can he accomplished as easily and quickly as 
was apparently the case in the illustrative example. The principles in¬ 
dicated are very general in their nature and applicability, and there are, 
in addition, some other principles that may he brought to hear in case of 
difficulty. Of these, perhaps the most useful are the following: 

(1) In normal English it is unusual to find more than two consonants 
in succession, each of high frequency. If in a cryptogram a succession 
of three or four letters of high-frequency appear in succession, it is 
practically'certain that at’least one of these represents a vowel, 1< “ ) 

(2) Successions of three vowels are rather unusual in English . 11 
Practically the only time this happens is when a word ends in two vowels 
and the next word begins with a vowel , 12 

(3) When two letters already classified as vowel-equivalents are 
separated by a sequence of six or more letters, it is either the case 
that one of the supposed vowel-equivalents is incorrect, or else that 
one or more of the intermediate letters is a vowel-equivalent.^ 

(4) Reference to Table 7-B of Appendix 2 discloses the following: 

Distribution of first 18 digraphs forming 2 g percent of English text 

Number of consonant-consonant digraphs--— b 

Number of consonant-vowel digraphs--r------ - 6 

Number of vowel-consonant digraphs--— -- —-—- 8 

Number of vowel-vowel digraphs----— 0 

Distribution of first 53 digraphs forming go percent of English text 

Number of consonant-consonant digraphs-‘-- 8 

Number of consonant-vowel digraphs--- 23 

Number of vowel-consonant digraphs--—---- l 8 

Number of vowel-vowel digraphs'--- b 


Sequences of seven consonants are not impossible, however, as in 
STR ENGTH THRO UGH. 

11 Note that the word RA DIOED , past tense of the verb RADIO, is coming 
into usage. 

12 A sequence of seven vowels is not impossible, however, as in THE 
WAY YOU EARN. 

13 

Some cryptanalysts place a good deal of emphasis upon this principle as a method of locating the remaining 
vowels after the first two or three have been located. They recommend that the latter be underlined throughout 
the text and then all sequences of five or more letters showing no underlines be studied attentively. Certain 
letters which occur in several such sequences are sure to be vowels. An arithmetical aid in the study is as follows; 
Take a letter thought to be a good possibility as the cipher equivalent of a vowel (hereafter termed a possible 
vowel-equivalent) and find the length of each interval from the possible vowel-equivalent to the next known (fairly 
surely determined) vowel-equivalent. Multiply the interval by the number of times this interval is found. Add 
the products and divide by the total number of intervals considered. This will give the mean interval for that 
possible vo^\ el-equivalent. Do the same for all the other possible vowel-equivalents. The one for which the 
mean is the greatest is most probably a vowel-equivalent. Underline this letter throughout the text and repeat 
the process for locating additional vowel-equivalents, if any remain to be located. 
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The latter tabulation shows that of the first 53 digraphs which form 50 
percent of English text, 4l of them, that is, over 75 percent, are com¬ 
binations of a rowel with a consonant. In short, in normal English the 
vpwels and the high-frequency consonants are in the long run distributed 
fairly evenly and regularly throughout the text, 

(5) As a rule, repetitions of trigraphs in the cipher text are com¬ 

posed of high-frequency letters forming high-frequency combinations. The 
latter practically always contain at least one vowel; in fact, if refer¬ 
ence 'is made to Table 10-A of Appendix 2 it will be noted that of the 

56 trigraphs having a frequency of 100 or more contain one vowel, 17 of 
them contain two vowels , and only three of them contain no vowel. In the 
case of tetragraph repetitions, Table 11-A of Appendix 2 shows that no 
tetragraph listed therein fails to contain at least one vowel; 27 of them 
contain one vowel, 25 contain two vowels, and 2 contain three vowels. 

(6) Quite frequently when two known vowel-equivalents are separated 
^hy six or more letters none of which seems to be of sufficiently high 

frequency to represent one of the vowels A E I 0, the chances are good 

that the cipher-equivalent of the vowel TJ or Y is present. 

.T-- 

d. To recapitulate the general principles, vowels may then be dis¬ 
tinguished from consonants in that they are usually represented by; 

(1) high-frequency letters; ' - 

(2) high-frequency letters which do not readily contact each other; 

(3) high-frequency letters which have a great variety of contact; 

(4) high-frequency letters which have an affinity for low-frequency 
letters (i.e., lcw-frecuency plaintext consonants). 

e. In the foregoing example the amount of experimentation or "cut¬ 
ting and fitting" was practically nil, (This is not true of real cases 
as a rule.) Where such experimentation is necessary, the underscoring of 
all repetitions of seve:ral letters is very essential, as it calls attent¬ 
ion to peculiarities of structure that often yield clues, 

f. After a few basic assumptions of values have been made, if short 
words or skeletons of words do not become manifest, it is necessary to 
make further assumptions for unidentified letters. This is accomplished 
most often by assuming a word.^-4 Now there are two places in every mes¬ 
sage which lend themselves more readily to successful attack by the assump¬ 
tion of words than do any other places—the very beginning and the very 
end of the message. The reason is quite obvious, for although words may 
begin or end with almost any letter of the alphabet, they usually begin 

-j. 

^_ 'This process does not involve anything more mysterious than ordinary, logical reasoning; there Is nothing 
of the subnormal or supernormal about it. If cryptanalytic success seems to require processes akin to those of 
medieval magic, if “hocus-pocus ' 1 is much to the fore, the student should begin to look for items that the claimant 
of such success has carefully hidden from view, for the mystification of the uninitiated. Tf the student were to 
adopt, as his personal motto for all his cryptanalytic ventures the quotation (from Tennyson's poem Columbus) 
appearing on the back of the title page of this text, he will frequently find “short cuts r ' to his destination and wilt 
not too often be led 


\ 



91 


HE O TIUCTE fr 


REF ID:A56892 


and end with hut a few v§fy comifton digraphs and trigraphs. Very often 
the association of letters in peculiar combinations will enable the stud¬ 
ent to note where one word ends and the next begins. For example suppose, 
E, N, S, and T have been definitely identified, and a sequence like the 
following Is found in a cryptogram: 

. . .EUTSHE, . . 

Obviously the break between two words should fall either after the S of 
E N T S or after the T of E IT T, so that two possibilities are offered: 

, . • ENTS / HE . . ., or . . .ENT/SNE. . .. Since in 
English there are very few words with the initial trigraph S N E, it is 
most likely that the proper division is . . • BHTS/HB . , M Of 
course, when several word divisions have been found, the solution is 
more readily achieved because of the greater ease with which assumptions 
of additional new values may be made. 

jg. Although a considerable amount of detailed treatment has been 
devoted to vowel-consonant analysis, it is felt advisable again to caution 
the student against the natural tendency to accept without question the 
results of any one eryptanalytic technique exclusively, even one such as 
vowel-consonant analysis which seems quite scientific in character. 

49, The "probable-word" method? its value and applicability .—a. In 
practically all eryptanalytic studies, short cuts can often be made”by 
assuming the presence of certain words in the message under study. Some 
writers attach so much value to this kind of an "attack from the rear" 
that they practically elevate it to the position of a method and call it 
the "intuitive method" or the "probable-word method." It is, of course, 
merely a refinement of what in everyday language is called "assuminG" or 
"guessing" a word in the message. The value of making a "good guess" can 
hardly be overestimated, and the cryptanalyst should never feel that he 
is accomplishing a solution by an illegitimate subterfuge when he has 
made a fortunate guess leading to solution. A correct assumption as to 
plain text will often save hours or days of labor, and sometimes there 
is no alternative but to try to "guess a word", for occasionally a system 
is encountered the solution of which is absolutely dependent upon this 
artifice. 

h. The expression "good guess" is used advisedly. For it is "good" 
in two respects. First, the cryptanalyst must use care in making his 
assumptions as to plaintext words. In this he must be guided by extra¬ 
neous circumstances leading to the assumption of probable words—not just 
any words that come to his mind. Therefore he must use’his imagination 
but he must nevertheless carefully control it by the exorcise of good 
judgement. Second, only if the "guess" is correct and leads to solution, 
or at least puts him on the road to solution, it is a good Guess, But, 
while realizing the usefulness and the time and labor-saving features of 
a solution by assuming a probable word, the cryptanalyst should exercise 
discretion in regard to how long he may continue in his efforts with this 
method. Sometimes he may actually waste time by adhering to the method 
too long, if straightforward, methodical analysis i7ill yield results more 
quickly. ) 
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e. Obviously, the ’'probable-word 1 ' method has much more applicability 
when working upon liaberJal the general nature of which is known, than when 
working upon more or lops isolated comraunioabionn exchanged between corres¬ 
pondents concerning whom or whose activities nothing is known. For in the 
latter case there ic little or nothing that the imagination can seize upon 
as a background or bosis for the assumptions *15 However, in the case of 
military cryptannlysis Ln time of active operations there is, indeed, so 
great a probability that cei-tain words and expressions ore present in cer¬ 
tain cryptograms that those words and expressions ("cliches") arc often 
referred to as "cribs" (as defined in Webster’s New Collegiate 'Diction¬ 
ary: ",*.a plagiarism; hence, a translation, etc,, to aid a student in 
reciting."). The cryptanalyst is quite sure they are present in the 
cryptogram under examination—'what he must do is to "fit' the crib to 
the text", that is, locate it in the cipher text, 

-&« Very frequently, the choice of probable words is aided or limit¬ 
ed by"the number and positions of repeated letters• These repetitions 
may be patent —that is, externally visible in the cryptographic text as 
it originally stands—or they may be latent —that is, externally invis¬ 
ible but susceptible of being made patent as a result of the analysis. 

For example, in a monoalphabetic substitution cipher, such as that dis¬ 
cussed in the preceding paragraph, the repeated letters are directly 
exhibited in the cryptogram; later the student will encounter many cases 
in which the repetitions are latent, but are made patent by the analyt¬ 
ical process. When the repetitions are patent, then the pattern or for¬ 
mula to which the repeated letters conform is of direct use in assuming 
plaintext words; and when the text is in word-lengths, the pattern is 
obviously of even greater assistance. Suppose the cryptanalyst is deal¬ 
ing with military text,. In which case he may expect such words as DIVIS¬ 
ION, BATTALION, etc,, to be present in the text. The positions of the 
repeated letter I in DIVISION, of the reversible digraph AT, TA in BAT¬ 
TALION, and so on, constitute for the experienced cryptanalyst tell-tale 
indications of the presence of these words, even when the text is not 
divided up into its original word lengths. 

e. The important aid that a study of word patterns can afford in 
cryptanalysis warrants the use of definite terminology and the establish¬ 
ment of certain data having a bearing thereon. The phenomenon herein 
under discussion, namely, that many words are of such construction as 
regards the number and positions of repeated letters as to make them 
readily identifiable, will be termed idiomorphism (from the Greek "idios"» 
one’s own, individual, peculiar + "morphe"sform). Words which show this 
phenomenon will be termed idi omorphlc « It will be useful to deal with 
the ldiomorphlsms symbolically and systematically as described below. 


. — 

♦General GIvierge in his Court de Ciyplographie (p 121) says. “However, expert cryptanalysts often 
employ Buch details as are citeiabove {in connection with assuming the presence of ‘probable words’], and the 
experience of the years 1914 to 1918, to cite only those, prove that in practice one often has at his disposal ele¬ 
ments of this nature, permitti lg assumptions much more audacious than those which served for the analysis 
of the last example. The reader would therefore be wrong in imagining that sueli fortuitous clomenls are 
encountered only in cryptographic works where the author deciphers a document that ho himself enciphered. 
Cryptographs correspondence, if it is extensive, and if sufficiently numerous working data are at hand, often 
furniskcB elements so complete that an author would not dare use all of them in solving a problem for fear of 
being accused of obvious exaggeration.” 
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f • When dealing with cryptograms in which the word lengths are 
determined or specifically shown, it is convenient to indicate their 
lengths and their repeated letters in some easily recognized manner or 
hy formulas. This is exemplified, in the case of the word DIVISION, by 
the formula ABCBDBEF; in the oase of the word BATTALION, by’the formula 
ABCCBDEFG* If the cryptanalyst, during the course of his studies, makes 
note of striking formulas he has encountered, with the words which fit 
them, after some time he will have assembled a quite valuable body of 
data. And after more or less complete lists of such formulas have been ■ 
established in some systematic arrangement, a rapid comparison of the 
idiomorphs in a specific cryptogram with those in his lists will be feas¬ 
ible and will often lead to the assumption of the current word. Such 
lists can be arranged according to word length, as shown herewith: 


3/aba 

DID, EVE, EYE, etc. 

abb 

ADD, ALL, ILL, OFF, etc. 

4/abac 

ARAB, AWAY, etc. 

abbc 

ALLY, BEEN, etc. 

abca 

AREA, BOMB, DEAD, etc. 

abcb 

ANON, CEDE, etc. 

etc. 

etc. 


jg. When dealing with cryptographic text in which the lengths of the 
words are not indicated or otherwise determinable, lists of the foregoing 
nature are not so useful as lists in which the words (or parts of words) 
are arranged according to the intervals between identical letters, in the 
following manner: 


1 Interval 
-DiD- 
-EvE- 
-EyE- 
dlvlsion 
revision 
etc. 


g Intervals 
AbbAey 
ArAbiA 
AblAtive 
AboArd 
-AciA- 
etc. 


3 Intervals 
AbeyAnce 
hAbitAble 
lAborAtory 
AbreAst 
AbroAd 
etc, 


Repeated digraphs 
COCOa 
-<3ERER 
ICICle 
-INIRg 
bAGgAGe 
etc. 


h. The most usual practice, however, in designating idiomorphic 
patterns and classifying them into systematic lists is to assign a lit¬ 
eral nomenclature to that portion of a word (or sequence of plaintext 
letters) "which contains the distinctive pattern, beginning with the first 
letter which is repeated in the pattern and ending with the last letter 
which is repeated in the pattern. Thus, the word DIVISION would he term¬ 
ed as an idiomorph of the abaca class (based on the sequence TVTST con¬ 
tained therein), and t he wo rd BATTALION as an idiomorph of the abba class 
(based on the sequence ATTA), In Appendix 3 will be found a compendium 
of the more frequent military words in English, arranged according to 
word-lengths in alphabetical order and in rhyming order; In addition, 
there will be found in this appendix a listing of idiomorphs arranged 
first according to pattern and then according to the first letter of the 
idiomorphic sequence. 
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50. Solution of additional cryptograms produced by the same compon ¬ 
ents .-■■•a* To return, after a rattier long digression, to the cryptogram 
solved in pars. 44 - 47, once the components of a cipher alphabet have 
"been reconstructed, subsequent messages which have been enciphered by 
means of the same components may be solved very readily, and without re¬ 
course to the principles of frequency, or application of the probable- 
word method. It has been seen that the illustrative cryptogram treated 
in paragraphs 4l - 47 was enciphered by juxtaposing the cipher component 
against the normal sequence so that ApsS c , It is obvious that the cipher 
component may be set against the plain component at any one of 26 differ¬ 
ent points of coincidence, each yielding a different cipher alphabet. 
After the components have been reconstructed, however, they become 
known sequences and the method of converting the cipher letters into 
their plain-component equivalents and then completing the plain-component 
sequence 3 -® begun by each equivalent can be applied to solve any crypto¬ 
gram which has been enciphered by these components. 

b. An example will serve to make the proeess clear* Suppose the 
following message, passing between the same two stations as before, was 
intercepted shortly after the first message had been solved: 

IYEWK CERNW OFOSE LFOOH EAZXX 

It is assumed that the same components were used, but with a different 
key letter. First the initial two groups are converted into their 
plain-component equivalents by setting the cipher component against the 
plain component at *any arbitrary point of coincidence. The initial let¬ 
ter of the former may as well be set against A of the latter, with the 
following result: 


Plain— 

Qipher- 


-ABCDEFGHIJKLMNOPQRSTUVWXYZ 

-LEAVNWORTHBCDFGIJKMPQSUXYa 

Cryptogram-—- IYEWK CERNW ...' 
Equivalents— PYBFR iBHEF ... 


The plain component sequence Initiated by each of these conversion equiv¬ 
alents is now completed, with the results shown in Fig. 15. Note the 
plaintext generatrix, CLOSEYOURS, which manifests itself without further 
analysis« The rest of the message may he read either by continuing the 
same process, or, what is even more simple, the key letter of the message 
may now be determined quite readily and the message deciphered by its 
means. 


It must be noted that if the plain component is a mixed sequence,, 
then it is thiB mixed sequenee which must be used to complete the 
columns, 
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IYEWKCERNW 
PYBFRLBHEF 
QZCGSMCIFG 
RADHTNDJOH 
SBEIUOEKHI 
TCFJVPFLIJ 
UDGKWQGMJK 
VEHLXRHNKL 
WFIMYS IOLM 
XGJNZTJPMN 
YHKOAUKQNO 
ZILPBVLROP 
AJMQCWMSPQ 
BKNRDXNTQR 
*C LOSEYOUHS ‘ 

DMPTFZPVST 
ENQUGAQWTU 
FORVHBRXUV 
OPBWICSYVW 
HQTXJDTZWX 
IRUYKEUAXY 
JSVZLFVBYZ 
KTWAMGWCZA 
LUXBNHXDAB 
MVYCOIYEBC 
NWZDPJZFCD 
OXAE QKAGDE 

Figure 15. 

c. In order that the student may understand without question just 
what is involved in the latter step, that is, discovering the key letter 
after the first two or three groups have "been deciphered "by the conver¬ 
sion-completion process, the foregoing example will he used. It was 
noted that the first cipher group was finally deciphered as follows: 

Cipher-. IIEiK 

Plain- CLOSE 

Now set the cipher component against the normal sequence so that Cp=I c , 


Thus: 

Plain- ABCDEFGHIJKLMNOPQRSTUVWXYZ 

Cipher- FGIJICMPQSUXYZLEAVNWORTHBCD 


It is seen here that when Cpsl c then Ap=F c . This is the key for the en¬ 
tire message. The decipherment may he completed hy direct reference to 
the cipher alphabet. Thus: 

Cipher— IYEWK CERNW OFOSE LFOOH EAZXX 
Plain— CLOSE YOURS TATIO NATTW OPM XX 

Message: CLOSE YOUR STATION AT TOO PM 
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d. The student should make sure that he understands the fundamental 
principles involved in this quick solution, for they are among the most 
important principles in cryptanalytics, How useful they are will "become 
clear as he progresses into more and more complex cryptanalytic studies. 

e. It must "be kept in mind that there are four ways v that two "basic 
sequences may he used to form a cipher alphabet, subject‘to the instruc¬ 
tions guiding the cryptographer in the use of his cryptosystem; this fact 
must "be considered when additional cryptograms appear in a particular 
cryptosystem for which the primary components have "been recovered. Assum¬ 
ing that the sequences just recovered are labelled "A" and "B", then the 
following contingencies might arise in the encryption of subsequent mes¬ 
sages! 

(1) "A" direct for the plain component, and M B" direct for the 
cipher component (as in the original recovery); 

( 2 ) "A 1 * direct for the plain, and "B" reversed for the cipher; 

(3) ”B" direct for the plain, and "A” direct for the cipher; and 

(4) "B" direct for the plain, and “A" reversed for the cipher. 

51. Derivation of key words .—a. Concurrent with the solution of a 

cryptogram, there should "be a simultaneous effort in the reconstruction 
of cipher alphabets and recovery of key words• Much labor can thus be 
saved as recovery of the keys early in the stages of solution may trans¬ 
form the process of cryptanalysis into one of decipherment, 

h. A mixed cipher alphabet falls into one of five categories, 
according to the composition of its components, viz ., 

(1) the plain component is the normal sequence and the cipher com¬ 
ponent is mixed; 

(2) the cipher component is the normal sequence and the plain com¬ 
ponent is mixed; 

(3) both components are the same mixed sequence; 

(k) "both components are the same mixed sequence, but running in 
reverse; or 

(5) the components are different mixed sequences. 

c. Let us examine several types of mixed sequences, using the key 
word HYDRAULIC as an example. The ordinary keyword-mixed sequence pro¬ 
duced from this key word is: 

(l) HYDRAULICBEFGJKMHQPQSTVWXZ 
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The two principal transposition-mixed types based on this key word are 
derived from the diagram: 


HYDRAULIC 
BEFGJKMNG ’ 

PQSTVWXZ and read: 

(2) HBPYEQDFSRGTAJVUKWLMXINZCOand 

(3) AJVCODFSHBPINZLMXRGTUKWYEQ 

Other types may arise from various types of route transpositions such as 
the following, using the foregoing diagram: 


(4) 

H 

B 

P 

Q 

E 

Y 

D 

F 

S 

T 

G 

R 

A 

J 

V 

■W 

K 

U 

L 

M 

X 

Z 

N 

I 

C 

0 

(5) 

H 

Y 

B 

P 

E 

D' 

R 

F 

Q 

S 

G 

A 

U 

J 

T 

V 

K 

L 

I 

M 

w 

X 

N 

C 

0 

z 

(6) 

P 

B 

Q 

H 

E 

S 

Y 

F 

T 

D 

G 

V 

R 

J 

W 

A 

K 

X 

U 

M 

z 

L 

N 

I 

0 

0 

(7) 

H 

Y 

D 

R 

A 

U 

L 

I 

C 

0 

N 

M 

K 

J 

G 

F 

E 

S 

P 

0 

s 

T 

V 

W 

X 

z 

(8) 

0 

C 

I 

L 

U 

A 

R 

D 

Y 

H 

B 

P 

Q 

S 

T 

V 

W< 

X 

z 

N 

M 

K 

J 

G 

F 

E 

(9) 

H 

Y 

E 

B 

P 

Q 

S 

T 

G 

F 

D 

R 

A 

U 

K 

J 

V 

W 

X 

Z 

N 

M 

L 

I 

C 

0 

(10) 

C 

P 

I 

0 

Q 

B 

L 

N 

S 

E 

H 

U 

M 

z 

T 

F 

Y 

A 

K 

X 

V 

G 

D 

R 

J 

w 


Any transposition system may he employed to produce a systematically- 
mixed sequence; practicability of method is the only determining factor. 

It must be remembered that the greatest amount of systematic mixing will 
produce a sequence inherently no more secure than a random-mixed alphabet, 

d. The student would do well to construct both enciphering and de¬ 
ciphering versions of cipher alphabets recovered, as has been previously 
mentioned. For example, in the following case 

Plain: JQNMFHLEBR5KGYZ0T ICDUVAWPX 
Cipher: ABCDEFGHIJKLMNOPQRSTUVWXYZ 

no semblance of a key is apparent; but in the inverse form 

Plain: ABCDEFGHIJKLMHOPQRSTUVWXYZ 
Cipher: WISTHEMFRALGDCPYBJKQUVXZNO 

the key-phrase "NOW IS THE TIME FOR ALL GOOD MEN TO COME TO THE AID OF 
THEIR PARTY" 1 b quite clear. In other types of mixed sequences, first 
the one form is attacked, and then if negative results are obtained the 
inverse form is treated, 

e. Let us consider the following cipher alphabet: 

P: ABCDEFGHIJKLMNOPQRSTUVWXYZ 
C: DWZMSOCRYATXBEFUGQHIVJKLNP 

The section V W X seems to comprise superimposed parts of the non-keyword 
J K L 

portions of mixed sequences. Adding Y Z to the plain component, we get 
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V W X Y Z which is certainly consistent as far as alphabetical progres- 
JKLNP 

sion goes, and indicates that the letters M and 0 are present in the key 
word of the cipher component. Continuing in this vein, the section 
MNOQSTVWXYZis rapidly established by correlating both se- 
BEFGHIJKLITP 

quences, It is obvious that the plain component key word begins right 
after the Z, and that the cipher component key word probably just pre¬ 
cedes the B, Going to the right, Z R H suggests key words like RHOMBOID, 

P Q R • 

RHEUMATISM, etc. These trials are quickly repudiated; therefore we go on 
to Z R E which is acceptable, Z R E K is found wanting, but Z R E P is 
PQS PQST PQSU 

very satisfactory, and this is soon expanded toZREPUBLIC, and in 

FQSUVWXYZ 

a moment or two we recover the complete cipher alphabet: 

Pi REPUBLIC! ANDFGHJKMOQSTVWXYZ 
C: QSUVWXYZDEMOCRATBFGHIJKLNP 

f, In the example below the student will observe that the alphabets 
are reciprocal: this is an indication of identical sequences at a shift 
of 13 , or that a mixed sequence running against itself in reverse has 
been employed. In this case the W X Y Z points to the latter hypothesis. 

Z Y X W 

P: ABCDE'FGHIJKLMNOPQRSTUVWXYZ 
Cj HOJFTDNAKCIMLGBSUVPEQRZYXW 

Starting with the V W X Y Z R cluster, we see that the key word begins 
R Z Y X W V 

with the letter R; therefore the next letter should be a vowel. Z R A 

¥ V H 

is not acceptable, but Z R E is fine, showing that the letter U appears 

W V T 

in the key word. Continuing the same line of reasoning as in the preced¬ 
ing example, and with a little further experimentation, the final alpha¬ 
bet is discovered to be 

P: REPUBLICANDFGHJKMOQSTVWXYZ 
C: VTSQOMKJHGFDHAC ILBUPERZYXW 

g. In the next example, all efforts to derive key words on the 
basis of keyword-mixed sequences are fruitless: the conclusion is there¬ 
fore drawn that this is a case of a transposition. 

P: ABCDEFGHIJKLMNOPQRSTUVWXYZ 
- C: ACSEJYIGWLFVMHXNKZPBQRDUTO 

Considering the mechanics of the cryptography involved, and assuming for 
the time being that Z is at the bottom of the matrix and not in the key 
word, we start with the; letters to the left, or if this fails, to the 
right of Z in the cipher component, obtaining the column N which is not 
" K 

z' 

incompatible if IT is in the key word on the top row. If we place Y to 
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the left of Z and build up Its column, we get E H which is excellent* 

JK 
Y Z 

This Is expanded into X M E N which quickly becomes 718435269 

GHJK ' PAITTm E N T 

W X Y Z BODFGSJKO 

QSUVWXYZ 

This last example was very easy because none of the letters V W X Y Z ap** 
peared in the key word; but other cases should hardly prove more difficult* 

h. Two additional methods that have been encountered for deriving 
mixed - "sequences maybe mentioned. One is a slight modification of the 
preceding paragraph, when the key word contains repeated letters J 

18734 9'526 

COM.IT.E. 

ABDFGHJKL 

HPQRSUVWX 

Y Z which produces the mixed sequences 

CAHYEKWFRIGSJVLXMDQOBPZTHU 

The other method is an interrupted-key columnar transposition systems 17 

5 1 3 4 2 6 

V A L . E Y 
B C) 

D F G H I) 

J K M) 

uopq) 

R) 

S T U W X Z) which produces the mixed sequences 

ACFKOTE IXLGMPUHQWVBDJHRSYZ 

The first example will succumb to the treatment outlined in subpar, g, 
whereas the second method is vulnerable owing to the presence of the 
fragments D J H, F K 0, and G M P in the sequence which may be anagram- 
med. Rote the fair-sized fragment B D J R R S, composed of an ascending 
sequence of letters; this is an outward manifestation of the interrupted- 
key columnar method. 

i, There are still other methods used for the production of mixed 
sequences, but space does not permit giving further examples. However, 
the student should by this time he able to devise methods of attack for 
any special cases that may present themselves, based upon the crypt- 
analytically exploitable weaknesses or peculiarities inherent in the 
system of cryptography involved. 


17 

It is to he noted that in this particular case the numerical key 
serves two purposes: (l) determining the cut-off point (and therefore 
the number of letters) in each row of the diagram, after the appearance 
of the keyword; and ( 2 ) determining the order of transcription of the 
columns. 


ft EOTRICTED 


106 


REF ID:A56892 


REOTIUCTEg 


SECTION VII 

, ,MULT3XITERAL SUBSTITUTION WITH SINGLE-EQUIVALENT CIPHER ALPHABETS 

.Paragraph 


General types of multiliteral cipher alphabets. . .. 52 

The Baconian and Trlthemian ciphers.. ... 53 

Analysis of raultiliteral, monoalphabet ic'substitution ciphers....... 5 ^ 

Historically Interesting examples..... 55 

The international (Baudot) teleprinter code.... §6 


• -• -52. General types of mult Hit eral cipher alphabets .--a. 
betic substitution methods in general may be classified into 
and multiliteral systems, In the former there is a strict "one-to-ope* 
correspondence between the length of'the units of the plain and those of 
the cipher text; that is, each letter of the plain text is replaced by a 
single character in the cipher text. In the latter this correspondence 
is no longer l p :lc but may’be lp:2 C , where each letter of the plain-text 
is replaced by a combination of two characters in the cipher text; or 
Ip:3 C > where a three-character combination in the cipher text represents 
a single letter of the plain text, and so on. A.cipher in which the cor¬ 
respondence is of the l p :l c type is termed uniliteral -in character; one 
in which it Is of the lp:2 c type, biliteral ; lp:3 c , triliteral , and so on. 
Ciphers in which one plaintext letter is represented by cipher characters 
of two or more elements are classed as multiliteral. 1 - ■ 

b. Biliteral alphabets are usually composed of a set of 25 or 26 
pombinations of a limited number of characters taken in pairs. An 
pxample of such an alphabet is the following: 


Plain-— 

- A 

B 

C 

* D 

E 

F 

G 

H 

I 

J 

K 

£ 

M 

Cipher— 

- VJW 


WI 

WT 

WE 

HW 

HH 

HI 

BT 

HT 

HE 

IW 

IH 

Plain- 

3’ 

0 

"p“ 

'Q 

R" 

§ 

T 

‘ U 

V ‘ 

W 

X 

Y 

Z 

Cipher— 

- II 

IT* 

IE 

TW 

TH 

TI 

4/ 

TT 

TE 

EW 

EH 

El 

ET 

EE 


_ - t. 

This alphabet is derived from the cipher square or matrix shown in 
Fig. l6. The cipher equivalent of each plaintext element is made up of 
two coordinate letters from outside the cipher matrix, one letter being 
the coordinate of the row, the other being the coordinate of the column 


1 -The terms unillteral and multiliteral, although originally applied 
only to cipher text composed of letters, are used here in their broader 
sense to embrace cipher text in letters, digits, and even other symbols, 
In more precise terminology, these terms would probably be monosymbolic 
and polysymbolic , respectively, but the terms uniliteral and multiliteral 
are too well established in literature to be changed at this late t ime - 
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In which the plaintext letter is located. In other words, the letters at 
the side and top of the matrix have "been used to designate, according to 

(S) 


H 

(1) I 
T 
E 


- • * Figure 1 6 . ■■ •' - v 

a coox*dinate system, the cell occupied by each letter within the matrix. 
The letters (or figures) constituting the coordinate elements of such 
matrices are termed row and column indicators . ~ 

<3. If a message is enciphered by means of the _foregoing biljLter&l 
alphabet, the cryptogram is still monoalphabetic in character. A fre.-. 
quency distribution based upon pairs of letters will obviously have all 
the characteristics of a simple, unillteral distribution for .a monoalpha¬ 
betic substitution.cipher. 

d. The cipher alphabets shown thus far in this text have involved 

only letters',' but alphabets in which the cipher component consists of 
figures, or groups of figures, are not uncommon in military cryptography. 2 
Since there are but 10 digits it is obvious that, in order to represent 
an alphabet of more than 10 characters by means of figure ciphers, combi¬ 
nations of at least two digits are necessary. The simplest kind of such 
an alphabet is that in which 4p=01, Bp=02, . . . Zp= 26 j that is, one in 
which the plaintext letters have_as their equivalents two-digit numbers 
Indicating their positions_in the normal alphabet. . . - 

e. Instead of a simple alphabet of the preceding type, it is pos¬ 
sible""to use a diagram of the type shown in Fig. IT. In this cipher ‘ . 



1 

2 

_2_ 

4 


6 

_z_ 

8 

JL 

± 

* 1 1 

A 

B 

c. 

D 

E 

F 

a 

H 

I 

3 

2 | 

K 

L 

M. 

N 

0 

P 

Q 

R 

s 

T 

3 

U 

V 

W 

X 

Y 

Z 

* 

t 

• 

m 

m 

t 


Figure 17. 


2 Although, as an extension of this idea, cipher alphabets employing 
signs and symbols are possible, such alphabets are not suitable for 
modern cryptography because they can be neither telegraphed nor tele¬ 
phoned with any degree of accuracy, speed, or facility. 


w HITE 


A 

B 

C 

b 

E 

F 

G 

H 

i-j 

K 

L 

M 

H 

0 

P 

Q 

R 

S 

T 

U 

V 

W 

X 

Y 

Z 
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tine letter Ap is represented "by the dinome^ 11, B "by the dinome 12, 
etc'. Furthermore, this matrix includes provision ror the encipherment of 
some of the froquently^used punctuation marks in addition to the 26 let¬ 
ters. - ■■ ; 

f . Other types of biliteral cipher alphabets are Illustrated in the 
examples below: rz~^L i. s * >- » ■ 



5 

6 7 

8 

9 

> 


1 

2 

3 4 


6 

JL 

8 

9 

1 

A 

B C 

D 

E 

F 

. 1 

A 

B 

C D 

E 

F 

G 

H 

I 

2 

G 

H I-J K 

L 

M 

2 

J 

K 

L M 

N 

0 

P 

Q 

R 

3 

N 

0 P 

Q 

R 

S 

3 

S 

T 

U V 

W 

X 

Y 

z 

* 

b 

T U-V W 

X 

Y 

Z 












.Figure l8. 


Z- -- 


- 

.Figure 19. 

+ 


- - 



- -- 

“ * 

— 








" 



- 


- ~ - 

- 

- 

— 

. . 






■ 1 

. *. 


- 

M 

N 

I 

C 

‘ H 

' - ~ 

A 

B 

C D 

E 

F 

V 

H 

I 

B 

G 

7 E 

5 

R 

"m 

A 

A 

D 

G J 

M 

P 

s 

V 

Y 

E 

A 

1 N 

T 

B 

2 

B 

B 

E 

II K 

N 

Q 

T 

W 

Z 

R 

C 

3 D 

b 

F 

6 

C 

C 

F 

I L 

0 

R 

U 

X 

1 

L 

H 

8 I 

9 

J 

i 

D 

2 

3 

4 5 

6 

7 

8 

9 

* 

I 

K 

L 0 

P 

Q 

s 





- 





N 

T 

U V 

w 

X 

z 




Figure 21. 





Figure 20. 


jg. It is to he noted that in alphabets of the foregoing types, the 
row indicators maybe distinct from the column indicators (e.g.. Fig. 18), 
or they may not (e.g., Fig. 19); of course, when there is any duplication 
between the row and column indicators, it is necessary to agree beforehand 
TipoiPwhich indicator will be given as' the first half of the equivalent 
"for a letter, in order to avoid ambiguity, (in all of the systems de¬ 
scribed in this and subsequent sections of this text, the row indicator 
will always form’the first'part of an equivalent). When letters are used 
as row and column indicators they may form a key word (e.g.. Fig. 20), or 
they may not (e.g.. Fig. 2l); the keywords, if formed, maybe identical 
(e.g.. Fig. l6) or different (e.g., Fig. 20;. Furthermore, the plaintext 
letters maybe arranged within the matrix as a mixed sequence (e.g.. Fig. 
20), either systematically- or random-mixed; and the matrix ms.y contain, 
in addition to the letters of the alphabet, punctuation symbols (Fig. 17), 
numbers (Figs. 20, 21), etc., permitting their encipherment as such, in¬ 
stead of having to be spelled out. 


■ A pair of digits is called a dinome ; similarly, a trinome is a set 
of three digits; a tetranome , a set of four digits; etc. Although a 
single digit would properly be termed a mononome, for the sake of euphony 
it is shortened into the term rnonome. 
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h. When letters are used as row and column indicators, they may he 
selected so as to result in producing cipher text that resembles arti¬ 
ficial words; that is, words composed of alternate vowels and consonants. 
For example, if in Figure l6 the row indicators consisted of the vowels 
A E I 0 U in this sequence from the.top down, and the column indicators 
consisted of the consonants B C D F G in this sequence’from left to right 
the word BAIDS would he enciphered as OCABE FAFOD, which very'"closely 
resembles code of the type formerly called artificial code language. 

Such a system may he called a false , or pseudo-code system, 

J., As a weak type of subterfuge, biliteral ciphers may involve a 
third’*'character appended to the ; T?asic two-character cipher unit; this is 
done to "camouflage" the biliteral nature of the cipher text, This third 
character may be produced through the use of a cipher matrix of the type 
illustrated in Fig, 22 (wherein Ap«6ll, Bp=6l2, etc.); or the third char¬ 
acter may be a "sum-checking" digit which is the non-carrylxig sum (i.e., 
the sum modulo 10)5 of the preceding two digits, such as in the trinomes 
257, 831, and 662 ; or it may merely be a randomly-selected character (in¬ 
serted solely for the purpose of leading the cryptanalyst astray), 



1 2 3^ 5 

61 

ABODE 

72 

F G HWK 

-83 

LHHOP 

94 

Q R S T TI 

05 

V W X Y Z 


Figure 22, 


TU7 


Another possibility that lends itself to certain multiliteral 
ciphers is the use of a word-spacer or word:separator . This word sepa¬ 
rator might be represented by a value in the matrix; i.e,,. the separator 
is enciphered (for instance, the dinogae "39" -in Fig. 19, might stand for 
a word separator). The word separator might instead te a, single element 
not otherwise used^in the cryptosystem; i.e,, unenciphered, and thus pot 
giving rise to any possible ambiguity. Thus, in Fig. 19 the digit $ and 
in Fig. 21 the letter J might be used as word separators, since jxo ef¬ 
fusion would av:fe in decrypting. t , ;; 


- - u*, • 


-£ ~ * ‘ - * ‘ 

Prior to 1934, international telegraph regulations, required code 

words of five letters to contain at least one vowel and code woVdq Of ten 

letters to contain at least three vowels. The International Telegraph 

Conference held in Madrid in 1932 amended these regulations to permit the 

use of 5~letter code groups containing any combination of letters.- These 

unrestricted code groups were authorized for use after JL January 1934. 

^ r1 

J The term modulo (abbreviated mod ) pertains to a cyclic scale or 
basis of arithmetic; thus, in the modulus of 7 , the numbers, 8 and 15 are 
equivalent to 1, and 9 and 16 are equivalent to 2, etc.; or expressed 
differently, 8 mod 7 is 1, 9 mod 7 is 2. In cryptology, many operations 
are expressed mod 10 and mod 26 . 


** -*lr% ' 
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k, The biliteral alphabets yielded by the matrices of Figs. 16-21 
may also "be termed bipartit e, because the cipher ■units of these alphabets 
may be divided into two separate parts whose functions are clearly de¬ 
fined, viz ., row indicators and column indicators. As will be discussed 
later, this bipartite nature of most blliteral alphabets produced from 
cipher matrices constitute one of the weaknesses of these alphabets which 
make them recognizable a,s such to a cryptanalyst. However, it is possible 
to employ a cipher matrix in a manner which will produce a blliteral 
alphabet not bipartite In character. For example, using the matrix of 
Fig. 23 one could produce the following hiliteral cipher alphabet in 


12 3 4 5 


' - 09 

H 

y 

D 

R 

A 

15 

U 

L 

I-J C‘ 

"-3 

: 21 

E 

F 

G 

K 

M 


H 

0 

F 

Q 

S 

’ * - -33 

T 

V 

w 

X 

Z 


*- -» V-r f J* < *f ^ V"* pi - *■ - .__ , 

V Figure 25." * - - - • - - - - 

Which the equivalent’ for any letter in the matrix is tfte spift'pf 
coordinates which indicate its cell in the matrix? 


Plain— 


A 

B 

C 

D 

E 

F * 

~G 


l 



& 

H 

Cipher— 


.14 

20 

19’ 

12 

22 

23 

24 

10 

ia 

18 

25 

IT 

26 

Plain- 


N 

0 

'P 

Q 

R 

S 

T 

U 

V 

w 

X 

y 

Z 

Cipher— 


28 

29 

30 

31 

13 

' t •- 

32 

t — 

34 

16 

35 

36 

37 

11 

38 


The cipher units of this alphabet are, of course, blliteral; but they are 
not bipartite. Note the equivalent of Ap, that is l4—if divided, it 
yields' the digits 1 and 4 which have no meaning per se ; plaintext letters 
whose cipher equivalents begin with 1 may be found in two different rows 
_ of the matrix, and those whose equivalents end in 4 appear in three dif- 
- ferent columns. 

" * * ~f » 

53. The Baconian and Trithemian ciphers .—a. An interesting example 
_-,.in which the cipher equivalents are five-letter groups and yet “the re- 
, suiting cipher is strictly monoalphabetic in character is found in the 
cipher system invented by Sir Francis Bacon ( 1561 - 1626 ) over 300 years 
ago. Despite its antiquity the system possesses certain features of 
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merit which are well worth noting .6 Bacon proposes the following 24- 
element cipher alphabet, composed of permutations of two elements taken 

five at a time:7 

♦ 


A=aaaaa 

Bsaaaah 

Czaaaba 

Draaabb 

Ezaabaa 

Fzaabab 

Graabba 

Hraabbb 


I-Jrabaaa 

Krabaab 

Lrababa 

Mrababb 

N-abbaa 

Orabbab 

Prabbba 

Qzabbbb 


Rrbaaaa 
Sebaaab 
T sbaaba 
U-V=baabb 
.Wrbabaa 
Xsbabab 
■ Y=babba 
Z=babbb 


If this were all there were to Bacon’s invention it would be hardly worth 
bringing to attention. But what he pointed out, with great clarity and 
simple examples, was how such an alphabet might be used to convey a 
secret message by enfolding it in an innocent, external message which 
might easily evade the strictest kind of censorship. As a very crude 
example, suppose that a message is written in capital and lower-case let¬ 
ters, any capital letter standing for an "a” element of the cipher alpha¬ 
bet, and any small letter, for a "b" element. Then the external sentence 
"All is well with me today' can be made to contain the secret message 
"Help.” Thus: 


A L 1 is 

a a b b b 

v-—--—' 

H 


WEIL 
t a a b a 

E 


W I t H 

a. .a b a 

—' v-^-v- 

i 

L 


m E 
b a, 


Today 

a b b b a 

*■--—-- ; 

P 


Instead of employing a device so obvious as capital and small letters, 
suppose that an "a" element be indicated by a very slight shading, or a 


For a true picture of this cipher, the explanation of which is often 
distorted "beyond recognition even by cryptographers, see Bacon’s own des¬ 
cription of it as contained in his De Augmentis Scientiarum (The Advance ¬ 
ment of Learning ), as translated by any first class editor, such as Gilbert 
Watts (1640) or 2 Ills, Spedding, and Heath ( 1857 , 1870 ). The student is 
cautioned, however, not to accept as true any alleged "decipherments"‘ob¬ 
tained by the application of Bacon’s cipher to literary works of the 16 th 
century. These readings are purely subjective. 

7 Bacon's alphabet was called by him a "biliterai alphabet" because it 
employs permutations of two letters. But from, the cryptanalytic stand¬ 
point the significant point is that each plaintext letter is represented 
by a 5-character equivalent. Hence, present terminology requires that 
this alphabet be referred to as a guinqueliteral alphabet . Although the 
quinqueliteral alphabet affords 32 permutations, Bacon used only 24 of 
them, because in the l 6 th century the letters I and J, U and V were used 
interchangeably. Note the regularity of construction qf Bacon's biliteral 
alphabet, a feature vhich easily permits its reconstruction from memory. 
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very slightly heavier stroke. Then a secret message might easily he thus 
enfolded within an external message of exactly opposite meaning. The 
number of possible variations of this basic scheme is very high. The 
fact that the characters of the cryptographic text are hidden in some 
manner or other has, however, no effect upon the strict monoalphahetlcity 
.of the scheme. ^ ^ __ . , . , 

b. Almost 100 yearB before Bacon’s time, the abbot Tritheiaius, 
born Johann von Heydcriberg (1462-1516), invented a triliteral alphabet 
which lie evidently intended to use in a fashion similar to Bacon’s 
alphabet; i.e,, as a means,of disguise or cover for t a secret text.^ This 
alphabet, modified to include the 2 6 letters of the present-day"English 
alphabet, is shown in Fig. 23 below; it consists of all the permutations 
of three things taken three at a time, i.e., 3^ or 27 in all. 


Arlll Dr121 'Grl31 J=211 

Brll2 Erl22 Hrl32 Kr212 

Crll3 F=123 1=133 Xi=2l3 


Mr221 
Nr 222 
Or 223 


Pr231 

Qr232- 

R=233 


Sr3ll 
T=312 
Ur 313 


- ■ h- 

Figure 23^ 


Vr321 

Nr322 

Xr3'23 




Y=331 

Zr332 

*=333 


-'■S’ 

t Bt 


The cipher text of course does not have to be restricted to digits; any 
' groupings of three things taken three at a time will do. ' ■- * 

54, Analysis of multiiiterai',"monoalphabetic substitution ciphers .— 

’ a". Biliteral ciphers and those of the other multiliteral (triliteral, 
quadriliteral, . . .) types are often readily detected externally by the 
fact that "the cryptographic text, is usually Compos'ed of but a very liml- 
- ted number of different characters. -* They are handled in exactly the same 
"manner"as 'are uniliteral, monoalphabetic substitution ciphers. So' long 
as the same character, or combination of characters, is always used to 
represent the same plaintext letter, and so long as a given letter of the 
•-plain text is always represented by the same character or combination of 
■ characters, 'the Substitution is ‘ strictly monoalphabetic and can be handled 

in the' simple .manner 'described'in the preceding section of this text. 

'= -.in - t . *. n • . 

b. In the case of biliteral ciphers in which the row and column in¬ 
dicators are not identical, and the direction of reading the cipher pairs 
is chosen at will for each Succeeding cipher pair, an analysis of the 

- contacts of the letters comprising the cipher'pairs will disclose that 
there are two distinct families of letters, and a cipher pair will never 
consist of . two letters of the same" family. With this fact discovered, 
the cipher may be quickly reduced to uniliteral terns and solved in the 
manner previously mentioned, _ .* ' " ' . 

c. If a multiliteral cipher includes provision for the encipherment 
of a word separator, the cipher equivalent of this word separator may be 

' readily identified because it will have the highest frequency of any ci¬ 
pher uriit’V' On the other hand, if the word separator is a single char¬ 
acter (see subpar. 52 on the use of the digit 0 and the letter j), this 
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character may he identified throughout the encrypted text hy its position¬ 
al appearance spaced "wordlength-vise" in the cipher text, and hy the 
fact that it never contacts itself. If this single character is used as 
a null indiscriminately throughout the cipher text, instead of as a word 
separator, the analysis is a hit more complicated hut not as great as 
might he thought. , * - - 

d. As a general rule, it is advisable to reduce multiliteral cipher 
text to uniliteral equivalents/*especially if a triliteral frequency dis¬ 
tribution is to be^made. If not mope than 36 different combinations are 
present in a cryptogram, the extra values over 2 6 maybe represented by 
digits for the purpose of this reduction. If, however, more than 36 
different combinations are found in the encrypted text, it is usually not 
worth the trouble to attempt any uniliteral reduction, and the cipher' 
text can be attacked in its multiliteral groupings. 

e. As one of the first steps in the solution of any multiliteral 

cipher in letters which appears to involve the use of a cipher matrix, 
it is generally advisable to anagram the letters comprising the row. and 
column Indicators in an attempt to disclose any key words for these in¬ 
dicators . When the anagraraming process does disclose such a key word or, 
words, the next step is to make a skeleton reconstruction matrix which is 
a duplicate of the original enciphering matrix in that the Indicators are 
arranged in the same order as on the original. Then, as plain text is 
recovered In the cryptogram hy any of the methods outlined In the. previous 
section of this text, the recovered plaintext ietters should he inserted 
In the proper cells of the reconstruction, matrix, so that any ^systematic 
arrangement of the plaintext letters, if present in the original, may be 
disclosed prior to recovery of the complete plain text.. Furthermore, it 
may in some instances he found worthwhile, immediately after successfully 
uncovering the key words used as indicators, to make a frequency distri¬ 
bution of the particular cryptogram in the form of tally marks within the 
properly arranged frame of the reconstruction matrix, because it may he 
that a few moments 1 study of the locations of the crests and troughs In 
the distribution made in that form may, if the plaintext letters are ar¬ 
ranged in the normal sequence or in a keyword-mixed., sequence (especially 
if it is related to the key words for the indicators), provide a basis 
for the derivation of this sequence at one stroke, without recourse to_ 
analysis of bh. cipher text. ' . ‘ ‘ \ " - 

55* Historically interesting exampl es.—a. Two examples of multi¬ 
literal ciphers of historical interest will he cited as illustrations. 
During the campaign for the presidential election'of 1876 (Hayes vs. 
Tilden) many cipher messages were exchanged between the Tilden managers 
and their agents in several states where the voting was hotly contested. 
Two years later the Ilew York Tribune^ exposed many irregularities in the 


Q ^ , 

New York Tribune, Extra No. 44, The Cipher Dispatches, New York, 

1879 . 
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campaign "by publishing the decipherments of many of’these messages." 
These decipherments were achieved by two investigators employed by the ^ 
Tribune, and the plain text of the'messages seems to show that" illegal * 
attempts and measures to carry the election fop Tilden were made""by his 
managers. Here is one of the messages: 


RANEY, Tallahassee. 


JACKSONVILLE, Nov. 16 (1876). 


Ppyyemnsnyyypim a~s hnsyy's, p XX epaaehshns 
"pensshnsmmpiyysnppyeaapieissyeshainsssp 
eeiyyshnynsssyepiaanyitnsshyyspyypinsyy 
ssitemeipimmeisseiyyeissiteiepyypeeiaass 
imaayespnsyyianssseissmmppnsp ins snp i ns in 
imyyitemyysspeyymmnsyyssitspyypeapppma 
aa'yypiit 

L'Engle goep up tomorrow."."■ T * • ... -* - 

' ‘ - * 1 DANIEL. 

Examination of the message discloses that only ten different letters are 
used. It is probable, therefore, that what one has here is ,a cipher 
which employs a multiliteral alphabet. First assuming that •the alphabet 
is one in which combinations of two letters represent single "letters of 
the plain text, the message is rewritten in pairs and substitution Of 
arbitrary letters for the pairs is made, -.as seen helowi 


A triliteral frequency distribution is then made and analysis of the mes¬ 
sage along the lines illustrated in the preceding section of this text 1 
yields solution, as follows: ~ T *■* ’ - -- - 

. ' _J_ - - ^ - - - " _ , ~ 

; g" Jacksonville, Nov. lo. 

- ; _ * »>,# *k i £; ... , 

GEO. F. RANEY, Tallahassee: * * ■ 

Have Marble and Coyle telegraph for influential men ""from Delaware -and 
Virginia. Indications of weakening here. Press advantage and watch 
Board. "L'Engle goes up tomorrow'. “ “ *• 

- * - , , “ i - _ 1 

: ' 4 -DANIEL. 

- ' . . . 

b. The other example, using numbers, is as follows: 

‘ : “Jacksonville, Nov. 1J . 

S. PASCO andE. M. L’ENGIE: 

84 55 84 25" 93 34 82 31 31 75 93 82 77 33 55 42 

93 20 93 66 77 66 33 84 66 31 31 93 20 82 33 66 

52 L8 44 55 42 82 48 89 42 93 31 82 66 75 31 93 

-- -- - DANIEL." " 


i . -r r * 
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There were, of course, several messages of like pature, and examination 
disclosed that only 26 different "numbers in all were used. Solution of 
these ciphers followed very easily, the decipherment of the^one given" 
above being as follows7 7* ’V* ’"* ' ’ ' .*"" ] 

Jacksonville, Nov. 17.. 

S. PASCO and E. M, L’ENGLEj 

Cocke will be ignored, Eagan called in. Authority reliable. * ' '’’ *’ 

' * • * .DANIEL. 

£♦ The Tribune experts gave the following alphabets as the result 
of their decipherments: - _ . _ ..... 


AA-0 

EN»Y 

3T=D 

NS=E 

PP-H 

SS=N‘ 

-- . 

AI S U 

EP=C 

MA=B 

NYrM 

SH=L 

YE=F 


EI«I 

IA=K 

MM=G 

PE=T 

SNzP 

YIzX 


EM=V 

B1=S 

NN=J 

PI=R 

SF=W 

YY=A 


20 =D 

33=N 

44 =H 

62 =X 

77=G 

89=Y 

‘ "* — 

25=K 

34=^ 

48 =T j 

66 rA 

' 

82=1 

93 =E 

- 

27=8 

39=P 

52 :U " 


84 rC 

96 =M 


3l=L 

42=R‘" 

'55=0 

75 =B 

» i* 

87 =V 

99=J 

11 i"* f- 


They did not attempt to correlate these alphabets, or at least they say 
nothing about-a possible relationship. _Tft e present author has, however, 
reconstructed the rectangle upon which these alphabets are based, and it 
is given below (Fig. 24)’. 
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■ ^d.Leifer or Number 

‘ - H I S 'P F' Y M 1 N T 

..12 34 5 67 8 9 0 

H 1 

12 

S 3 
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P 4 ’ 

A 5 
Y 6 
M 7 
E 8 
N 9 
T 0 
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Figure 24. 
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It ig amusing to note that the conspirator selected as their key a 
phrase quite in keeping with their attempted illegalities - HIS PAYMENT - 
for bribery seems to have played a considerable part in that campaign. 

The blank cells in the matrix probably contained proper names, numbers, 
etc.9 _ „ - - - - . 

_ w •% rvr t m* 

56. The international (Baudot) teleprinter code .—a. Modern print¬ 
ing telegraph systems,1^ or teleprinter systems as they - "are more often 
called, make use of a five -unit code- 1 - 1 or alphabet which is similar to 
the Baconian alphabet treated in par. 53* Like the Baconian alphabet, 
the teleprinter alphabet is composed of permutations of two elements 
taken five at a time, making it possible to obtain 32 different permu¬ 
tations, 26 of which are assigned to the letters of the alphabet, leaving 
1 for an "idle condition" and 5 for certain printer operations called 
functions , such as "space", "figure shift", "letter shift," etc, 

b. During electrical transmission, the two distinct elements of 
which each character is composed take the form of (l) a timed interval of 
electrical current and (2) a timed interval of no current, which are 
commonly referred to as "mark" Impulses and "space" impulses, respective¬ 
ly. In certain operations, a paper tape is prepared of the traffic to he 
transmitted, or a paper tape may be prepared of the incoming traffic at 
_ the receiving end; in such tapes, the elements of the Baudot characters 
take the form of punched holes ("mark" impulses) and imperforate positions 
("space" impulses). _ „ ^ ^ 


9 

As was mentioned in a previous footnote, a matrix containing such 
items would be termed a syllabary square; for example of such matrices 
- see the treatment of syllabary squares and code charts in Section X. 

- 3 - 0 Such systems are characterized "by the transmission and reception - 
printing of messages by electrical means, incorporating two electrically- 
connected instruments,resembling typewriters. When a key of the keyboard 
on the transmitting instrument is depressed, an electrical signal is 
transmitted to the receiving instrument, causing the corresponding char¬ 
acter to be printed therein. Usually the message is printed at the local 
as well as the distant station. The system has been adapted to radio as 
well as wire and overseas cable transmission,. 

The five-unit code was first applied to teleprinter systems by Jean 
Maurice Emile Baudot (18^4-5- 1-903 ), and is commonly known as the Baudot 
code. It is worthwhile to point out that Baudot apparently constructed 
his alphabet to correspond with normal frequencies of characters (with 
certain exceptions), since the most frequent ones are represented by per¬ 
mutations requiring the least electrical energy on the basis of "marking" 
and "spacing." In this respect Baudot "took a leaf out of Morse’s note¬ 
book." 
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£. The teleprinter code In Intel national use Is given In Chart ?, 
telovj wherein the mark and space impulses (knovm collectively as* hauls) 
are illustrated as the holes (shown as "black dots) and ‘'no-holes” of a 
teleprinter tape. The letter equivalents ("lower case") are self-explan¬ 
atory. The figure Bhift is used to change the meaning of a particular 
character to an "upper case" equivalent, and when it is desired to return 
to lower case, the letter shift is used? in regular teleprinter usage, 

** a J "* 11 1 "" 11 '' ' t ? -r + 


UPPER 

WEATHER SYMBOLS 

t 

© 

o 

/ 

3 

-* 

r—1 

1 

8 

✓ 

— 

L 

\ 


d> 

9> 

[71 

1 

4 

[71 

5 

J1 

7] 

Z 

71 

iH 

* 


R1 

s 

i 


t 

X 

CASE 

COMMUNICATIONS 

- 

y 


7 

3 

T 

a 

jC 

►a 


fi 

> 


t 

9 

0 

i 

4 

A 

s' 



z 

/ 

6 



< 

= 

SPACE 

U. 

LOWER CASE 

A 

B 

c 

0 

E 

F 

G 

H 

i 

J 

X 

L 

M 

N 

0 

P 

0 

R 

s 

T 

u 

V 

w 

X 

Y 

z 

i 

ai. 

cc 

o 

u. 

J 

vl 

a: 

ts 

vV 

C 
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O 
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0 

0 

0 




0 

O 
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q 
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0 

0 

0 

0 



“ 

0 

0 
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6 


0 




0 


0 

0 

0 

0 




© 

0 



© 

0 

0 






□ 

■ 

0 

© 



FEED HOLES 

o 

o' 

o 


*o 

0 

o : 

V 

o 

0 

0 

o' 

0 

~Q 

~o~ 

0 

0 

o H 

cT 

it 

V 

"o' 

o 

7 

o 

o' 

0 

o 

□ 

B 

*o 

0 



3 



© 



Q 


¥ 

© 


0 


O 

0 


O 

0 

■ 1 

9 


•i 

O 


o' 

©j 




1 

□ 

0 

- — 



4 


0 

0 

¥ 


0 

0 




q 


Oj 

q 
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0j 



. j 
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0 
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'O 
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5 

_ 
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0 
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© 

”1 
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0 

9 

o' 

'0 
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■ 

0 

□ 

- 


} 


^7 
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Chart J. Ijpterpationaf teleprinter code, 


‘ * ) 


* ! 
t 


the "communications" set of qp'per-case equivalents are the ones recorded 
on the typed copy "by the teleprinter/ whereas the "weather symbols" are 
the upper-case equivalents whiclj. are printed in teleprinter systems de¬ 
signed for the sending and receiving of weather information. The space 
is used to separate words; the carriage return (c.R.) effects the retppj} 
of the teleprinter carriage"to the right and the line feed (L.F.) roll? 
the platen to the next _3ine for printing (cf» the corresponding functions 
of an ordinary typewriter), In addition, when the upper-case equivalent 
of "S" is used, a "bell rings in the receiving teleprinter as a signal to 
call the operator to his machine, or to indicate that traffic is about to 
"be sent. ” — ' x 

- 3 % I» ~ - . ■*' ■* - - • 

d. In Fig. 25 is shown a portion of a teleprinter tape containing 
the "beginning of the phrase "Now is the time for all good men ..." 


<> • 

* • 


• « • 

* ••• • 


F* #••• 

•• • • 


• • 


• • •• • 
• • • 


• •• 

• ••• • « 
•• ••• • 


NOW IS THE TIME FOR ALL GOOD MEN 


Figure 25. 

The small holes, one of which appears in every position of the tape "be¬ 
tween the second and third levels, are sprocket holes used for advancing 
the tape through the transmitter unit. 
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e. It is bo be emphasized that messages are not made secure from 
unauthorized reading merely by sending them by means of an ordinary tele¬ 
printer system—the teleprinter alphabet is internationally known, just 
as the English, Russian, etc. alphabets are. In order to provide secur¬ 
ity for a teleprinter message, it is just as necessary to apply thereto 
some sort of cryptographic treatment as it is to any other kind of mes¬ 
sage. The cryptosystems used for teleprinter encryption may involve 
either, or both, of the two classes of ciyptographic treatment, viz., 
substitution and transposition. A substitution treatment might involve 

_ changing certain of the mark impulses of the characters comprising a mes¬ 
sage to space impulses, and vice versa, according to a prearranged sys¬ 
tem, a transposition treatment might involve changing the order of the 5 
impulses in the Baudot equivalents for the characters comprising a mes¬ 
sage j and so on. The cryptographic treatment can be accomplished by a 
special cipher attachment (called an "applique unit") to a teleprinter } 
thus ho modification of the teleprinter itself would be necessary. There 
are, of course, self-contained cipher teleprinters designed as such for 

engineering or cryptographic reasons, or both. 

% 

f. In the analysis of encrypted teleprinter systems, recourse is 
had to special tables 12 of the frequencies of single Baudot characters, 
digraphs, trigraphs, etc., as they appear m teleprinter traffic. It is 
important to note that in teleprinter traffic, as in any other type of 
traffic involving the use of a word separator, this character has the 
highest frequency of any plaintext elementI Furthermore, one of the 
highest-frequency plaintext digraphs, in a Idition to those wherein the 
word separator constitutes one of the elements, will be the com! ination 
"carriage-return/line-feed", since this combination of characters is used 
in the normal procedure of typing each line of text on the teleprinter. 


12 In such tables, as is common in cryptanalytic practice, the mark im¬ 
pulses are designated by a plus symbol ( + ), and the space impulses are 
designated by a minus symbol ( - ). In addition, it is usual In such 
tables to denote the character representing the figure shift by the digit 
"2", the space by "3", the letter shift by "4", the line feed by "5", the 
blank by "6", and the carriage return by "7". 
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- - * SECTION VIII 

- — Xta _ h f ~ 

MOLTILITERAL SUBSTITUTION UITH VARIANTS 

’ ^ "%. ' 

- \ - ~~ Paragraph 

Purpose of providing variant s jLji_monOjalphabetic substitution. 57 

Simple types of cipher alphabets with variants....". 7.7.. 58 

More complicated types of qipher alphabets with variants. 59 

Analysis of simple Example's.*..... 60 

Analysis of more complicated examples. 6l 

Analysis involving the use of isologs... 62 

Further remarks on variant systems.... 63 


57» Purpose of providing variants In monoalphabetic substitution .-- 

a It has been seen that the individual letters composing ordinary 
intelligible, plain text are usedwith varying frequencies, some, such as 
(m English) E, T, R, I, and N, are used much more often than others, 
such as J, K, Q, X, and Z. In fact, each letter has a characteristic 
frequency which affords definite clues m the solution of simple mono- 
alphabetic ciphers, such as those discussed in the preceding sections of 
this text. In addition, the associations which individual letters form 
in combining to make up words, and the peculiarities which certain of 
them manifest in plain text, afford further direct clues by means of 
which ordinary monoalphabetic substitution encipherments of such plain 
text may be more or less speedily solved. This has led cryptographers to 
devise methods for disguising, suppressing, or eliminating the foregoing 
characteristics manifested in cryptograms produced by the simpler methods 
of monoalphabetic substitution One - category of such methods, the one to 
be discussed in this section, is that in which the letters of the plain 
component of a cipher alphabet are assigned two or more cipher equivalents, 
which are called variant values (or, more simply, variants ). 

b. Basically, systems involving variants are multiliteraT 1 - and, in 
such systems, because of the large number of equivalents made available 


Uniliteral substitution with variants is also possible. Note the 
following cipher alphabet, illustrated by Captain Roger Baudoum m his 
excellent treatise. Elements de Cryptographie, p. 101 (Paris, 1939) 


Plain 

Cipher 


ABCDEFGHILMNOPQRSTUVXZ 


L G 0 


R F 0 
K 
V 


A H C 


M B T 
X 


I D N 


PUS 

Z 


Y E W J 


Baudouin proposed that Jp and Yp be replaced by Ip, IL by C or Q p , and 
Wp by Wp—thus four cipher letters would be available as variants for / 
tne high-frequency plaintext letters in French. f 
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by the combinations and peimutations of a limited number of elements, each 
letter of the plain text may he represented by several multiliteral cipher 
equivalents which may be selected at random. For example, if 3-letter 
combinations are employed as the multiliteral .equivalents, there are 
available 26^ or 17,57^ such equivalents for the 26 letters of the plain 
text, they may be assigned in equal numbers of different equivalents for 
the 26 letters, in which case each letter would be representable by 676 
different 3-letter equivalents, or they may be assigned on some other 
basis, for example, proportionately Ijp the lelative frequencies of plain¬ 
text letters. For this reason this type of system may be more completely 


described as a monoalphabetic, multiliteral 
equivalent ciphcx alphabet .^ Some authors < 


substitution with a multiple * 
erm such a system "simple 


substitution with multiple equivalents", others term it "monoalphabetic 
substitution with variants", or multilitera1 substitution with variants . 
For sake of brevity and precise terminology, the latter designation will 
be enrployed in this text, Jt being understood without further xestatement 
that only such systems as are monoalphabetic will be discussed. 


c. The primary object of monoalphabetic substitution with variants 
is, as has been mentioned above, to provide several values which may be 
employed at random in a simple substitution of cipher equivalents for the 
plaintext [letters 

d. A word or two concerning the underlying theory of (monoalphabetic) 
multiliteral substitution with variants may not be amiss. Whereas in 
simple or single-equivalent substitution it has been seen that 

(1) the same letter of the plain text is invariably represented by 
but one and always the same character of the cryptogram, and 

(2) the same chai acter of the cryptogram invariably represents one 
and always the same letter of the plain text, 

- - - t - 

in multiliteral substitution with variants it will be seen that 


(1) the same letter of the plain text may be represented by one 
or more different characters of the cryptogram, hut 

(2) the sane charactex of the cryptogram nevertheless invariably 
represents one and always the same letter of the plain text. 


58. Gimpli l Ties of cipher alphabets with variants ,—a. The mat¬ 
rices shown on the next page provide some of the simpler means for 
accomplishing monoalphabetic substitution with variants. The systems 
incorporating these matrices axe extensions of the basic ideas of multi- 
litcxal substitution treated in par 52 The variant equivalents for 
any plaintext letter maj be chosen at will, thus, m Fig. 26, Ep=10, 15, 
60 , 01 65, m Fig. 27 , Fp=AU c , r S/ c , FU C , FZ C , LU C , 01 LZ C , etc 


Cl. the title of the picceding section, "tultiljleral substitution 
with single-equivalent cipher alphabets," „ 

— . f 1 !** n' - ** (XT *" 


M *«* — 
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Figure 2 6 


Figure 27 


'Figure 28 
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Figure 29 


Figure 30 


Figure 31 


7 4 

8 5 
9 6 


123 4 56789# 


123^56789 


1 

ABCDEFGHIJ 

7 4 1 

ABCDEFGHI 

2 

KLMNOPQRST 

852 

JKLMNOPQR 

3 

U V W X Y Z‘. , , 

963 

STUVWXYZ* 


Figure 32 


Figure 33 




12345 6 7 8 9 


12345 6 7 8 9 


5 l 

ABCDEFGHI 

0 8 5 1 

TERMINALS 

6 2 

JKLMNOPQR 

9 6 2 

BCDFGHJKO 

7 3 

STUVWXYZ1 

r 7 3 

PQUVWXYZ1 

8 k 

2 3 4 5 6 7 8 9?. 

h 

23456789? 


Figure 311- 


Figure 35 
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b. It is to "be noted that encipherment by meons of the matrices m 
FIguips 27# 28, and 33 is commutative , i.e,, he coordinates may he road 
in either row~col$upn or column-rcju order witlijut cryptographic ombigui ty, 
since I here i3 no 'duplication "between the ro-vf cyp.d column coordinates. 

The remaining matrices above are non-commutgitive , therefore a convention 
must he ogieed upon as to the older of reading the coordinates. It should 
also he noted that in Figs. 30 and 31 the letters in the square have been 
inscribed in such a manner that# coupled with the pariicular arrangement 
of the raw and column coordinates, the number of variants available for 
each plaintext letter is ^roughly proportional io the frequencies of the 
letters in plain text A sunilir idea ib found in Fjg. 35# vbeiem the 
top row of the rectangle contains a woid composed of high-irequency let¬ 
ters# and the coord] nates are arranged in a manner' roughly corresponding 
to the frequencies of plaintext letters. The matrix In Fig. 28 is a mod¬ 
ification of the pseudo-code system described m par. 52h, with the added 
feature of variants. ^ A , 

, jc• Other simple ideas’ for producing variant systems are matrices 
such as the following * » 1 


ABCDEFG1I I-J KLIiHOPQRffTUVWXYZ 


08 09 10 U r 32 13 11 15 16 J-7 38 J9 20 21 22 23 2b 25 01 02 03 Oil- 05 06 07 
35 36 37 38 39 bo Hi 42 >13 44 I15 !t6 U7 48 49 50 26 27 28 29 30 31 32 33 3b 
68 69 70 71 72 73 7*1 75 51 52 53 5*i 55 56 5f 58 50 60 61 62 63 6b 65 66 67 
87 88 89 90 91 92 93 95 96 97 98 99 00 76 77 78 79 80 81 82 83 84 85 86 


Figuie 36 


A B G D E F 'G H ,1 J IC L’ II II 0 P O '"if S T U V U X I Z‘ 


M J 

T U 



14 15 16 17 10 19 20 21 22. 23 24 25 26 01 02 03 04 05 06 07 08 09 10 11 12 13 
27 28 29 30 31 32 33 3 it 35 t 36 37 38 39 *i0 Hi 42 43 44 45 46 47 48 49 50 51 52 
58 59 60 61 62 63 Ok 65 66 67 68 69 70 71 72 73 (b 75 76 77 78 53 5*1 55 56 57 
81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 9 / 98 99 00 'M^S/////////A 79 80 


\ 

K Figure 37 i\ 

In these two m tuces there has been a regular inscription oi the dmomes 
m the 1QVP. 1 1 thermore, m FiG 36 the dmomes 01, 26, 51# and 76 (1 e., 

the lowest number in each of the foul sequences) give the hey woid (TRIP) 
foi that matrix, and iq Fag 37# the dmomes 01, 27# 53# end 79 denote 
the ley woid (NAVY) for that natri v The securny of systems involving 
such matrices would of course be greatly iirpioved 1 the dmores we.re 
assigned m a random mannei but then the easy mnemonic ieature of the 
foul sequences and the lev woid would be lost. 
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a. ma LntcrcsLing adaptation in = a, disc form of bho type of matrix 


illustrated in FLg. 37 is the follow uig device reputedly once used "by 
the Mexican Aimy~ * ‘ ' * ' * \ 


4 



v u 
1 r 


“i 
IU- . 




i - VX » t i ifJl _ 1 

The device consisted of five-concentric discs, the outer disc hearing the 
26 letters of the alphabet, and the other four hearing the sequences 
01-26, 27-52, 53-78, and 79-00. The rotatable discs made it possible to 
change the keys at frequent intervals, without the necessity of writing 
out a new matrix each time *»*- ’ 


99 More complicated types of cipher alphabets with variants .— 
a. Matrices sUch as those in Figs. 38, 39and 46 below are termed 
frequential matrices , si ice the hurnbei” of cipher values available for any 
given plaintext letter closely approximates its relative plaintext fre- 
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Figure 39 
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h. In the fragmentary matrix illustrated in Fig. 38, the number of 
occurrences of a particular letter within the matrix is proportional to 
its frequency in plain text, the letters are inscribed in a random manner, 
in order to enhance further the security of the system. In Fig. 39* tre 
have a modification of the idea set foith in Fig. 38* except that the 
size of the matrix has heen reduced from 26 <26 to 10x10, in this case, 
the letters (with appropnate number of lepetitions) have been inscribed 
in a simple diagonal route (lower left to upper right) within the square, 
and the coordinates have been scrambled, for greater security. In Fig. 4o 
there is illustrated a type of cipher square which is known in crypto¬ 
logic literature as the Grandpre cipher , in this square there are in¬ 
scribed ten 10-letter words containing all the letters of the alphabet in 
their approximate plaintext frequencies These ten words are further 
linked together by a 10-letter word which appears vertically in the first 
column, as a mnemonic feature for the inscription of the words in the 
rows • • • -.»•*■*» rt. iiiti 

c. The frequential-type system represented in Fig. 4la (enciphering 
matrix) and 4lh (deciphering matrix) was described by Sacco?, who pro¬ 
posed that the dinomes inscribed m the enciphering matrix be thoroughly 
disarranged by applying a'double transposition to the dinomes 00-99 as a 
means of suppressing ofiy patent relationships among the variant values 
for the variouc plainte'ct letters, furthermore,'the nulls incorporated in 
the matiix were to be used "occasionally during the encryption of a mes¬ 
sage, m order to throw a cryptanalyst off tlie irack In this example the 
number of yariant vplues for each plainte 't letter has been established, 
of course,' from the standpoint of Italian letter frequencies. 

\r m jl . 4 , 

. ' r - i «-.««* , 

*3 * » i ‘ . 

Sacco, Generale Luigi, Ifonuale di Cnttografia, 3d Ed., Rome, 19'1-7 j 
p. 22. * ir* “ ~ ' 
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Nulls 

J18-56 

21-09 

76-54 

42-12 

01-74 

55-l , i 

83-90 

63-06 

47-45 



13 

I 

03-25 

10-35 

10-23 

52-62 

37-65 

53-75 

79-69 

71 -78 

82-87 


0 

07-30 

51-67 

72-89 


Q 

20 

1 ( 

V 

02-06 

R 

u 

26-9l|- 

95 

HE9 

T 

Y 

33-88 

22 

U 

Z 

00-15 

34 

36-99 

03 

59 


seven 

U6 


eight 

29 


Tive period 
60-91 l6-6l 


figure 4la 
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d. The Baconian cipher described in par. 53a may be used as a bosis 
for superimposing, additional complexities For instance, the "a" elements 
may be represented by any one of the 20 consonants os variants, while 
the "b" elements may be represented by any one of the six vowels, or the 
letters A-M may be used to lepresent the "a" elements and the letters 
N-Z for the "b" elements, digits may be used for the "a” and "b" elements, 
either on the basis of the first five and last five digits, or on the 
basis of the odd and even digits, or the first 10 consonants (B-M) and 
the last 10 consonants (N-Z) may be used for the "a" and "b" elements, 
with the vowels used occasionally as nulls--thus the resultant crypto-_ , 
grams will resemble those of a fairly complex cryptosystem However, * 
once the cryptanalyst assumes the possibility of such a system, its com¬ 
plexity is more apparent than leal Similarly, variations of this genre 
maybe superimposed on triliteral systems such as the Trithemian'cipher 
illustrated In par. 53b; valiants for the "1", "2", and "3" elements may 
be chosen m such a way as to provide a large number of equivalents for 
each basic triliteral combination 

£. Another scheme for e complex variant system is a summing-trinome 
system . In this cryptosystem, each plaintext letter is assigned a unique 
value of 1 to 26 , this value is then expiessed as a tiinome, the digits 
of which sum to the designated value of the letter . For example, if a 
letter has been*assigned the value "4", it may be represented by any one 
of the following permutations and combinations 1 ' 


oo 4 

031 

112 

202 

301 


013 

040 

121 

211 

310 


022 

103 

130 

220 

400 



Since the values toward the middle of the lange 1-26 maybe represented 
by a very considerable number of summing-trmomes (e.g., for the values 
13 and 14 there are 75 variants each), such a system would offer a crypto¬ 
grapher wide latitude m the choice of cipher equivalents in enciphering. 


" v - 




r 


* The representations of an integer (i.e., a whole number) as the sum 
of integers in all possible ways are termed thg partitions of that num¬ 
ber. The partitions in bhi3 subparagraph are mod 10 and also include the 
digit (f) in order to form tnnome equivalents out of^ all the possible per¬ 
mutations. ^ L ~ , j __ 
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especifilly if the "basic values *of the plaintext letters were chosen to 
correspond with the scale of their relative frequencies, such as the fol¬ 
lowing ' 

jqBVYUFHDI ONSTHA.SL CPMGVXKZ 
0133456789 1011121314151617 1819 20312233 34 25 36 27 

gg.% €U g g g^A mmgmmm'm 
1 g g m % g m % g w n g m 

gg g ngmggmgw g- 

, -An £ & g j? % Cp g gggm = 

■ggggg g g gwmm 

— g g g g g g gg g gg g~~ 

' £C£r#*"». 

-if j v , s ggmg g\gA ss^ 

' . - h -art t pi ^ ' f= H ?=• I? 

~*-Si g g g £kgg 


i\> 't y t | g ^ = 

f" /*r - ,r ^ 

. y ^ 

J - VX ’5 " r ' ' -JtW;. 

r 'T - r » t* - t i / %_ jfftf 


! *'**'*' 


J » ' 


/w ffit 'Tt Jb 


Jt l - 1 


^ - 


/ • 


The tallies beneath each value represent the number of variants possible 
for the particular value Th<= unused values for 0 and 27 (uniquely re¬ 
presented by 000 and 999 > respectively) may be used for punctuation marks, 
nulls, or other speclal-pupf>ose symbolsT Since such a system, once sus¬ 
pected, would offer litt]^ difficulty^ to a cryptanalyst, certain modifi¬ 
cations would be necessary m order Jo pose any real obstacles in the way 
of solution. For instance, if the numerical value of a letter is expres¬ 
sed by permutations of 3_ letters (indeed of digits) out of a set of the 
10 letters A-J wherein the sequence o? the letters A-J represents a dis ¬ 
arranged sequence of the digits $-9> such a system may be among the most 
complex types of ciphers,m the realm of monoalphabetic substitution, 
requiring the solution of many simultaneous equations. A further refine¬ 
ment would involve the use of all 26 j.eJters as variants, m predetermined 
groups, to represent the digits ^-9 Fortunately for the cryptanalyst, 
such systems are impracticable for -fi&ld military use, but if they were 
encountered, a sufficiently large volume of text, coupled with Hitt’s 
four essentials quoted m Section I, vpuli eventually make a solution 
possible. Thei actual cryptanalytic complexity of certain apparently 
exceedingly cqnrplex cryptosystems is dep/ndent on their being correctly 
used at all times, which is not invariably the case with military ciphers. 


5 * - , 

J The solution would involve simply dividing the cipher text into 

groups of 3 digits, summing the trinomes thus produced to yield 28 pos¬ 
sible basic values, and solving these basic values as m any simple mono- 
alphabetic substitution cipher. 
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60. Analysis of simple examples .—a. The following cryptogram is 


available 

for 

study 
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z 
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F 

D 

M 

L 

T 

B 

P 
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L 

M 

1) 
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V 

F 
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- 
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The first thing that strikes the v eye is the total absence of vowels, re¬ 
markable not only because six letters are misiTing (cf. the A test) in a 
text of this size, but also becauae~al] six of these letters fall into 
an identical limited category—a significant non-random phenomenon. Since 
a uniliteral substitution alphabet with six letters missing is highly 
improbable, the conclusion of multiliteral substitution is obvious. 

Upon closer inspection it is found that, if the cipher text is divided 
into pairs of letters, only ten consonants (BDGJL¥QSYX) are used 
as prefixes, and the remaining ten consonants (CFHKMPRTWZ) are 
used as suffixes—thus the biliteral (and bipartite ) characteristics of 
the cipher text are disclosed. A digraphic° distribution is therefore 
constructed ' *- ■» j " 

•V J yj J 1 . * - r " 1 -* I ** * ? T - -t 

1 4 CFHKMPRTWZ ' • * 


B 

D 

G 

J 

L 

N 

Q 

S 

V 

X 


i r> '**■ T 


_ "i 

rtf r - 


\ If it had not been noticed that the cryptogram should be divided 
into, pairs for analysis, a biliteral distribution (see par. 23 d) might 
have^heen made, in order to reveal contact affinities of the cipher let¬ 
ters. “ 1 -*** ,f * 


.2 


• RESTRICTED 


130 








REF ID:A56892 


RESTRICTED 


b. It is possible that the cryptogram under study may involve Ihc 
use of a small encipliei xng matrix vith_varlants for the rows and columns. 
Since there is available an easily-applied special solution winch permits 
the detoxmijaatIon of the rojr indicators yhich are equivalent (i.e., inter¬ 
changeable variants) and the column indicators which are equivalent, mere- 
ly from a study of the digraphic distribution, this possibility xs examin¬ 
ed. The special solution is based on the following considerations, m a 
message of moderate length for such a cryptosystem, it may be assumed 
that the various possible'cipher pairs for a given plain-text letter will 
be used with approximately equal frequency, for this reason, the cipher 
letters which pair with one of the letters used to indicate any particular 
row of the enciphering matrix may be expected to pair equally often with 
any other cipher letter which has been,uged to indicate the same row (and, 
of course, the same is true concerning the co]u mn-indicator letter^'I . 

Thus, in the digraphic distribution of such a cryptogram, sets of runs 
appear which have similar "profiles" and, likewise, sets of similar col¬ 
umns .7 First a study will be made of the rows of the distribution t just 
compiled, in ah attempt to locate and isolate those which match with each 
other, then, the"same will be done with the’columns of the distribution. 

c. It is noted that the "L" and "V" distributions have pronounced 
similarities (Fig. 42a)—these rows came under consideration first because 
of their unique "heaviness" of their frequency characteristics. Likewise, 
the "D" and "N" rows have,homologous attributes m their appearance (Fig. 
42b). However, the further grouping of the rows by ocular inspection may 
present difficulties to the student, since b ■* may not yet trust his eye 






Figure 42b. 


- 5 ^ ** 

in matching distributions, and he may feel„the need for some kind of 
statistical assurance. In the following subparagraphs there is given the 
technique of a more precise method for matching, rriathemat ic al m nature. 


7 - 

1 These similarities are especially pronounced when the encipherer 
uses a "check-off" procedure for choosing his variants for each letter, 
that is, when he systematically "checks off" the variants used during 
encryption to msuie that all possible variants are used_xn approximately 
equal proportions. ~ ‘ 
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* A. This method of* matching in an attempt to “equate* 1 interchangeable 
variants involves computing a separate value for each trial matching Of'a 
particular row (or column) against each of a series of other rows (or 
columns, as appropriate)—such a value is taken as an indication of the 
“goodness of match" exhibited by the particular trial, the theory being 
that the correct match will produce the highest value.” The value fofc a 
particulai trial match is computed by multiplying the number of tallies 
in each cell of one row (or column) by the number of tallies in each cor¬ 
responding cell in the other row (or column) end then totaling the pro¬ 
ducts thus obtained. Because of the way in which it is produced, such a 
value is termed a "cross-products sum". - - - - 

- - —— ^ 'i — rv i 

e. In subparagraph^ above, it was determined thft the "Jr and "V" 
rows’were equivalent and^that the "D“ and "N" rows,also formed an equiv¬ 
alent pair .* The next ‘ ,f heavy" row is the "G" row, this’ is to be tested 
for match with the five remaining unmatched rows. Let t}ie ""G“ row be 
tested first against the row Inese two rows are given below, with 
their cross-products sum. For convenience, the cross-products sum is 
symbolized by #(©3-, ©2), where ©1 and ©2 represent the designators of the 
distributions to be''matched 9 


“G" 

t»B” 

*(G,B) 


222-3--1-1 
3111322121 
6 2 2 - 3 - - 1 - 1 


15 


- J 


The complete table pf the comparisons of the "G" row with the five avail¬ 
able lows is as follows 


%(G,B) 

*(G,Q) 

J(G,S) 

*(G,X) 


6 2 2 
2 2 2 

- 4 - 
2 k 4 

- 2 - 


3 

3 

3 

6 

6 


- l = 15 

- 1 = 11 

- - = 7 

- 1 = 17 

- - : 8 


The results indicate that the most probable match with the “G" row is the 
"S" row. 

f. Smcf the next "heaviest" row to be tested is the "B" row, its 
matchings with the .three remaining rows are made, and are given below 


*(B,4 


3111124-121 = 17 
-2 - 2‘ 122-21 = 12 
- 3 - 1 2 *2 2 - 4 - = 12 


«•«?*» I' 

a; 


*r - 


■> r J 
•5*- * 




j-n- r r - — 

° In this connection, note Lhe considerations treated m subpar. 60j. 

^ The Gicek 3ettei % (chi) is often used m cryptoDogy to syii'bolize 
matching operations t 4 t . » - < 


r 
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Tho conccL matching of Iho "B" and "J" roas is indicalcd By llio results. 
This leaves on]y the "Q" and "X" rows, which sic presumed bo {50 together, 
since no-t on3y is their cross-products sum satisfactory (when compared to 
the % values Tor some of tho olhcj rows which have been matched), hut, 
equally important, bhcii patterns o i crests and troughs are simi 3 ar. 

Since ire have not found more than two rows for any one sob of inter¬ 
changeable values, IL appears that the original matt 3 x had only five 
rows, with two variants for each row. The rows of tho distribution dia- 
giam are therefore combined m the fo] lowing diagiam 


CFIIKMPRTWZ 



Figure 43 


g. Analysis of the distributions of the columns of Fig. 43 quickly 
reveals that columns "C" and "H" may be matched as a pair, and likewise 


columns F and M , and columns P and R 


In order to decide the 


groupings of the remaining columns, the six possible ^ values are der¬ 
ived. 

Combinations 

KT, WZ 81 + 90 ; 171 

KU, TZ* 113 + 73 = 186 
_ KZ, TU 88 + 83 = 171 


*(K,T)- 

4 

35 

- k2 - = 

81 

x(K,w)- 

4 

49 

- 49 9 = 

113 

£(K,Z) 

4 

35 

- 49 

88 


6 

35 

- 42 - = 

83 ' 


4 

25 

2 42 - = 

73 

£(w,z) 

6 

35 

- 49 - = 

90 


until 


“ ■■ S X J n 

It appears that the proper pairings of the columns are "K n and "W", "T 
and "Z" 


t 


^ h. The groupings of'the cblumns having been determined, the fre¬ 
quency diagram is reduced to its basic 5 x 5 square, and the <b test is 

\ £ ****" * * ** ru m ^ *■ 


**■ A 

* 

C F K” *P (T 
H M W R Z 

BJ 

6 4 5 7 4 

DN 

16 4 1J1 ii 10 

GS 

7 9-13 

rj. W 

3 15 14 17 13 

- OX 

- 6 6-4 - 


«* «V 


<j>p=l 962 

<t r ;1132 

4>o=l670 


taken as further statistical assurance of the* matchings. Although <(t 0 in 
this case does not come up to the best expectations, we feel nevertheless 
that the matching has been carefully and correctly accomplished, and so 


r- 
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the next step is continued with a conversion of the multiliteral text 
into unilileral equivalents, using the following reduction square con¬ 
taining an arbitrary sequence 



C 

F 

K 

P 

T 


H 

11 

W 

R 

Z 

BJ 

A 

B 

C 

D 

E 

DN 

F 

G 

H 

I 

K 

08 

L 

M 

N 

0 

P 

LV 

Q 

R 

S 

T 

U 

QX 

V 

W 

X 

Y 

Z 


The converted cryptogram is now easily solved, using the principles set 
forth in Section VI. The first fifteen letters of the plaintext message 
are found to read "WEATHER FORECAST . and the original enciphering 

matrix is recovered, based on the key word ATMOSPHERIC, as follows 



P 

F 

c 

K 

T 


R 

M 

H 

W 

Z 

LV 

A 

T 

li 

0 

S 

DN 

P 

II 

E 

R 

I 

BJ 

C 

B 

D 

F 

G 

GS 

K 

L 

N 

0 

U 

OX 

V 

y 

X 

Y 

Z 


i. The method of matching rows and columns just described in the 
preceding subparagraphs applies equally well to all the matrices in Figs. 
26-35# and similar variations If m the process of equating indicators 
the cryptanalyst sees that the row indicators are falling into the same 
groupings as the column indicators, he might be able to accelerate the 
equating proceos by taking advantage of this feature alone, as would be 
the case if he had encountered a cryptogram involving a matrix with indi¬ 
cators arranged m a manner similar to that shown m Figs 29 and 30. 
Furthermore, a cryptogram enciphered m a commutative system, wherein 
the equivalents have been taken m row-column and column-row order indis¬ 
criminately, may be recognized as such through a study of the digraphic 
distribution or the cryptogram since the ” <* " row of the distribution i 

will have an appearance similar to the " «< " column, the " ft " row will f 

be similar to the " ft " column, etc, 1 - 0 this matter is discussed further in 
subpar 6ld. , r /» 


10 It is olten convenient to use arbitrary symbols in cryptanalytic 
woilc, to pievcnt confusion with designations of. pc Lyot elements of plain 
text, cipher text, or key (see footnote 1 on page 58)• For this purpose 
Greek letters are often used, Jlor reference, the 2*1 letters of the Greek 
alphabet# and their names are appended in the chait below 

firw ^ t ” * 


A * 

alpha 

e 

£ 

epslion 

I 

u iota 

N nr 

nu 

P 


ro 

<$> 

t 

phi 

13 ft 

be la 

£ 


^ela 

K 

n Lappa 

X i 

XI 

£ 

<r 

° ijina 

X 

% 

chi 

r * 

gorinu 

H 

•*1 

ct i 

A 

^ lambd i 

O c, 

omcron 

T 

X 

tau 

V 

i 


A 5 

dc. LI i 

0 

8 

the La 

AV 

p Ml 

n ir 

P3 

r 

V 

j Loni 

n. 

CO 

omo^a 
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j. 3 1 ^s intpoi 1 ml bo point out that in matching, Hie cryptanalyst 
should begin VLlh the "best" rows or columns—best not only from the 
standpoint of "heaviness" of the distribution, but o]so boob from the 
point of view 5C a distinctive pattern of crests and troughs. If insuf¬ 
ficient text is available to allow equating all the inteichangeable 
cooidltialps of a particular enciphering matrix, it may still be possible 
that a conVcision of tne cipher text by meant of a partially-reduced rc- 
consbiuction*matrix may yield enough ldiomorphic patterns and other data 
to make possible an entry into the text. If the cryptographer hos not 
used a "chcclc-orf" process m enciphering, but instead has favored certain 
equivalents for the various plaintext letters, matching moy not be pos¬ 
sible, nevertheless, an entry into the text may be facilitated in this 
case, because some oi the resultant peaks in the cipher text may bo cor¬ 
rectly identified Furthermore, since no variant system can possibly 
disguise the letters of lov frequency in plain text, their low-frequency 
equivalents in the cipher text may provide possible approaches to solu¬ 
tion (See also subpar. 6le). 

k. In addition bo the method ol solution by matching and combining 
rows and columns of a digiaphic distribution of a mu3tiliteral cipher, 
theie is also the general approach app3icable without exception to any 
variant system. This method, involving the correlation of cipher elements 
suspected to be the equivalents of specific but unknown plaintext letters, 
is treated in detail in paragraphs 6l and 62 . 

l . Systems such ns the 4-level dmome cipher illustrated in Fig. 3^ 

are susceptible to a very easy solution, if the dinomes have been in¬ 
scribed in numerical order ■'diratod. Assuming such a case m a 

specific cryptogram, the first six groups •>!’ vhich are 

68321 09022 4 8 0 "5 7 65111 886 4 8 42036 .. 


a four-part frequency distribution of the entire message, is taken as 
illustrated m Fig. 4-4 below 


, “ ^ __ __ r _ _ 

oi 02 03 04 05 06 07 08 09 10 31 J 2 33 ^ 15 lb 17 18 19 20 2 i 22 23 24 25 

26 27 i8 29 30 31 5 33 34 35 5 37 38 39 *io 4 i 42 43 44 45 46 47 48 49 fo 


51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 1 68 69 70 71 72 73 74 75 


76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 9] 92 93 94 95 96 97 98 99 00 

Figure 44. 
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/ If the student vill bring to bear upon thij problem the principles he 
learned in Section V of i}his text, he will soon realize that what he now 
/ has before him are four simple, monoalphabelie frequency distributions 
similar to those involved in a monoalphabctic substitution cipher using 
i standard alphabets. The realization of this fact/ immediately provides 
the clue to the next step* "fitting each of the/distributions to the 
normal". (See par. 3l)• This cgm be done without diificulty in this 
case (remembering that a £5-letter alphabet is /involved and assuming that 
I and J ore combined) and the following alphabets result 


01—I-J 

2.6—U 

51—N 

76—E 

02—K 

27—V 

52—0 

77—F „ 

03—L 

28—W 

53—P 

78-33" 

04—M 

29—X 

54—Q 

79—H 

05—N 

30—Y 

55—R 

80—I-J 

06—0 

31—Z 

56—S 

81—K 

07—P 

32—A 

57—T 

82—L 

08—Q 

33—B 

58—U 

83—M 

09—R 

34—C 

59—V 

84—N 

10—S 

35—D 

60—W 

85—0 

11—T 

36—E 

61—X 

86—P 

12—U 

37—F 

62—Y 

87—Q 

13—V 

38—G 

63—Z 

88—R 

14—W 

39—H 

64—A 

89—S 

15—X 

40—I-J 

65—B 

90—T 

16—Y 

41—K 

66—C 

91—U 

17—Z 

4 2—L 

67—D 

92—V 

18—A 

43—M 

68—E 

93—W 

19—B 

44—N 

69—F 

94—X 

20—C 

45—0 

70—G 

95—Y 

21—D 

46—P 

71—H 

96—Z 

22—E 

47—Q 

72—I-J 

97—A 

23—F 

48—R 

73—K 

98—B 

24—G 

49—S 

74—L 

99—C 

25—H 

50—T 

75—M 

00—D 


The key word is seen to be JUNE and the beginning of the cryptogram is 
deciphered as "EASTERN ENTRANCE." 

m. If instead of 25-element alphabets, a system such as that in 
Fig. 37 has been used, onl/ a slight modification of the procedure in 
subpaiagiaph j_ would have been necessaiy, i.e., the distributions would 
have had to be consideied on a basis of 26 , and the process of fitting 
the distributions to the normal would have gone on as m the previous 
example„ 
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n. One [tu. Lher application of principles learned 3 n Section V de¬ 
serves to be mentioned here, in connection with the solution of system 0 
3 uch as those of Fig. 36. Let the following short message he considered 
t -4 - - y ^ /. \ - — - ^ 

4 8 2 2 6 8 8 4 2 3 52099 9360 4 76059 05651 

36683 52267 9711 4 54466 76 

c _ a - - " ' ’ * * *; *,; -„ * , 

If it is known that the correspondents have heen using lx variant system 
such as that in Fig. 36, a special solution may he employed in those cases 
wherein there is insufficient cipher text to permit analysis hy the method 
oi fitting the frequency distribution to the normal. Thus, a short cryp¬ 
togram may he solved hy a variation o C the plain-corapon&nt completion 
method described in par. 34 . ^ First, let the cryptogram he copied in 
dinomes, with an indication of the lcve 3 (i.e., the "alphabet") the di- 
nonle vould occupy m the 4 -level matrix, thus 


II ' -- + 

48 22 68 84 23 52 09 99 36 04 76 05 90 56 51 36 68 35 22 67 97 11 45 44 66 76 
2 5 1 ■ ^ 1 n 2 l 4 14 3 3 2 3~2 I 3 4 12 2 3 IT 


- A ■* * * ’•at X - * 

The dinomes belonging to the four levels are as follows 

' - ‘ \ . , “ 1 

(1) 22 23 09 04 05 22 11 * _ 

(2) 48 36 36 35 45 44 

(3) 68 52 56 51 68 67 66 ‘ 

( 4 ) 84 99 76 90 97 76 ’ * *■ • * J 


1 4 „T 

These dinomes are converted into terms of the plain component by setting 
each of the cipher sequences against the’plain component at an arbitrary 
point of coincidence, such'as in the following example 
, - / r 1 


ABC DEFGH 3 I-JKLMN 0 PQ. RSTUVffXTZ 
01 02 03 04 05 06 07 08 09 10 11 12 13 14 15 16 If 18 19 20 21 22 23 24 25 

26 27 28 29 30 31 32 33 34 35 36 37 38 39 l, 0 4 i 42 43 44 45 46 47 48 49 50 

51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 

76 77 78 79 80 81 82 83_84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 00 


(1) 22=W, 

( 2 ) 48 =X, 

( 3 ) 68 =S, 

( 4 ) u 84 = 1 , 

f r 4 


•44 It should be clear to the student that the reason this method can be 
applied in this instance is that both the plain component (ABC.... Z) and 

the cipher co^onent ( 01 , 02 , 03 . 25 , 26 - 50 , 51 - 75 , 76-OO) are knoTm 

sequences (or thus assumed). 


23 =x. 

09=1, 

04 =D, 

05 =E, 

ro 

ro 

% 

11 =L 

36=l, 

36 =L, 

35 =K, 

45 =U, 

44 =T 


52 =B, 

56 =F, 

53 =A, 

68=S, 

67 =R, 

66=Q 

99 =Y, 

76 =A, 

90 =P, 

97 dl, 

76=A 
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o. The plain component sequence is now completed on the letters of 
the four^levels, as follows* - ' 


i* 



1 st level 



2 d level 


3 d level 

i 

w 

4 th level 

w 

X I D 

E 

W 

L 

X 

L 

L 

K U T 

S 

B 

F 

A S 

R 

Q 

I Y A P W 

A 

X 

Y K E 

F 

X 

M 

Y 

M M L 

V 

U 

T 

C 

G 

B 

T 

S 

R 

K Z B Q X 

B 

Y 

HF 

G 

Y 

W 

Z 

W 

N 

M W 

V 

U 

D 

H 

C 

U T 

S 

L A C R Y 

C 

Z 

A M G 

H 

Z 

0 

A 

0 

0 

W X W 

V 

S 

I 

D 

V 

U T 

M B D S Z 

D 

A 

B N H 

I 

A 

P 

B 

p 

P 

0 

Y 

X 

W 

F 

K E 

U 

V 

U 

W C E T A E 

B 

C 0 I 

K B 

Q 

C 

Q Q 

P 

Z 

Y 

X 

G 

L 

F 

X W V 

0 D F U B 

F 

Q 

D P K L 

C 

R 

D 

R 

R 

Q 

A 

Z 

Y 

H M 

G 

Y 

X W 

P E G V C 

G 

DEQL 

M D 

S 

E 

S 

S 

R 

B 

A 

Z 

I 

W 

H 

Z 

Y X 

QFHUD 

H 

E 

FRMlfE 

T 

F 

T 

T 

S 

C 

B 

A K 

0 

I 

A 

Z 

Y 

R 0 I X E 

I 

F 

GSN 

0 

F 

U 

G U, 

,u 

T 

D 

C 

B 

L 

p 

K 

B 

A 

Z 

S H K Y F 

K 

G 

H T 0 

P 

G 

V 

H V 

V 

U 

E 

D 

C 

M 

0 

L 

C 

B 

A 

T I L Z G 

L 

H 

I U P 

Q 

H W 

I 

V W 

V 

F 

E 

D 

N 

R 

M 

D 

C 

B 

UKMjlHM 

I 

K V Q 

R 

I 

X 

K 

X 

X 

w 

G 

F 

E 

0 

S 

N 

E 

D 

C 

V L W B I 

N 

K 

L W R 

S 

K 

Y 

L 

Y 

Y 

X 

H 

G 

F 

P T 

0 

F 

E 

D 

W M 0 G K 

0 

L 

M X S 

T 

L 

Z 

M 

Z 

Z 

Y 

I 

n _ 

0 

Q 

U 

P 

G 

F 

E 

Y W P D L 

P 

M W Y T 

U 

M 

A 

N 

A 

A 

Z 

K 

I 

II 

R 

V 

Q 

H 

G 

F 

Y 0 Q E M 

Q 

N 

0 Z U V 

W 

B 

0 

B 

B 

A 

L 

K 

I 

S 

U 

R 

I 

H 

G 

Z P R F W 

R 

0 

P A V W 

0 

C 

P 

C 

C 

B 

M 

L 

K 

T 

X 

S 

K 

I 

H 

A Q S G 0 

S 

P 

Q B W 

X 

P 

D 

0 

D 

D 

C 

N 

M 

L 

U 

Y T 

L 

K 

I 

BRTHP 

T 

Q R C X 

Y 

Q 

E 

R 

E 

E 

D 

0 

N 

M 

V 

Z 

U 

M 

L 

K 

csuiqu 

R 

S D Y 

Z 

R 

F 

S 

F 

F 

E 

P 

0 

II W 

A 

V 

W 

M 

L 

D T V K R V 

S 

T E Z 

A 

s. 

G 

,T, 

,Q 

G 

F 

Q 

P 

0 

X 

B 

W 

0 

N 

M 

E U W L S 

W 

T 

UFA 

B 

T 

H 

U 

H 

H 

0 

R 

0 

p 

Y 

C 

X 

P 

0 

N 

F V X M T 

X 

U 

V*G B 

C 

u 

I 

v; 

'i 

I R 

S 

R 

0 

Z 

D 

Y 

Q 

P 

0 

G W Y W U 

Y 

V 

W H C 

D 

V 

K 

w 

K 

K 

I 

T 

S 

R 

A E 

Z 

R 

Q 

P 

H X Z 0 V 

Z 


It is seen that the generatrices with the best assortment- 1 ^ of high- 
frequency letters for the four levels are 

» « 1 

1 st level 2d level 3d level 4 th level 


EIRHBl '3 


RE.E DOW EDSWEDC 


N C E T A L 


In evaluating generatrices, the sun of the arithmetical frequencies 
of the letters in each row may be used as an indication of their relative 
"goodness". A statistical 3 \ much more accurote method ol evaluating 
generatrices involves the use of logarithms of tlie probabilities of the 
plaintext letters fornlng uhe general!ices. (bee also footnote 7 on 
page 89 ) ” 
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IT tlie lettcis of these generati1ces aie ananged xn the oidox or appear¬ 
ance of tlieir dlnomc equivalents, according to Uie vay they fall in bo the 
various levels, 

48 22 68 8>i 23 52 09 99 3 § 04 H 05 90 56 51 36 68 35 22 67 97 11 jj£ 4 ji 66 7 6 

E PR MW E T 

R E ED OR 

E 0 SR E D C 

N - C E T A E 

the plain text "I&EW 0 P,CEMENTS HEEDED AT ONCE” is clearly seen. Or, ihore 
simply, iT vc examine the equrvalenls of 01, 26, 51, and 76 after the 
generatrix determination has been made, the key word JURE is levealed. 

If an error had been made m the selection of a generatux, the error 
could he resolved hy hypothesizing the probable key word, or by decipher¬ 
ing the text on the has os of the assumed diagiam and then noting and 
degarbling the systematic errors (which, it would be noticed, all come 
from one level) 

The student should note that no one generatrix will yield plain 
text all the way across as in the example m par, 34 Instead, the - 
generatrices must be considered separately for the four ]eveIs, since it 
is within each of the four levels that there is a homogeneous relation¬ 
ship of dinopies Obviously if dinomes from more than one level were used 
to complete the plain component sequence, the generatrices vould not con¬ 
sist of a homogeneous group of letters but instead would represent an 
assortment of letters'from two or more "alphabets" 

6 l, Analysis of moie complicated examples .—-a As soon as a begin¬ 

ner in cryptography realizes the consequences of the fact that letters 
are used with greatly varying frequencies in normal plain text, a bril¬ 
liant idea very speedily comes to him Why not disguise the natural fre¬ 
quencies of letters by a system of substitution,using many equivalents, 
and let the numbers of equivalents assigned to the various letters he 
more or less m direct propoi_Lio.fi lo tne noimal frequencies of the let¬ 
ters’ Let E, for example, have 13 equivalents, T, 9 , N, 8, etc., and 
thus (he thinks) the enemy cryptanalyst can have nothing in the way of 
telltale or characteristic frequencies to use as an entering wedge 

b. If the ted available for study ig smill in amount and if the 
variant values are wholly independent of one aftother, the problem can 
become exceedingly difficult. But m practical military communications 
such methods are rarely encountered, because the volume ol text is usually 
great enough to permit of the establishment of equivalent values . To 
illustrate what is meant, suppose a number of cryptograms produced by 
the monoalphabetic-variant method described above show the following 
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Wo sets of gioupinco^ of cipliei elements m the text. Set "A" being 
assumed to be diffeient representations of one particular underlying 
plain text, and Sol "B" assumed to be representations of another under¬ 
lying plain text 

Set "A" Set "B” 


(12-37-02-79-60-13-03-37-7 l ) 

(82-69-02-79-13-68-23-37-35) 

(82-69-53-16-13-13-78 05-35) 
(91-05-02-01-68-42-78-37-77) 


(71-12-02-51-23-05-77) 

(H-82-51-02-03-05-35) 

(11-91-02-02-23-37-35) 
(9 1 -12-51-02-78-69-77) 


An examination of these groupings wou3& lead to the following tentative 
conclusions with legard to piohahle equovnlc its 


(12, 82, 91) 
(05, 37, 69) 


( 02 , 51 ) 

(01, 16, 79) 


(13, 42, 68) 
(03, 23, 78) 


(35, 77) 

(13, 71, 97) 


The establishment of these equivalencies would sooner or 3ater lead to 
the finding of additional sets of equal values. The completeness with 
which this esn he accomplished vilY determine the ease or difficulty of 
solution Of course, if many equivalencies can he established the prob¬ 
lem can then be reduced practically to monoalphabetic terms and a speedy 
solution can be attained 


c. Theoretically, the determination of equivalencies may seem to be 
quite an easy matter, but practically it may be very difficult, because 
the cryptanalyst can nevei be certain that a combination showing what 
may appear to be a variant va3ue is leally such and does not represent 
a part of a different plaintext sequence. For example, tal r e the groups — 


17-82-31-82-14-63, and 
> ' •* - 27-82-40-82-14-63 

• * m ~ 

Here one might suspect tnot 1 7_and 2 J represent the same letter, 31 and 
40 another letter. ^But it ^happens i^hat one §roup represents the word 
MANAGE, the other BA14A.GE. # The*e are^hundreds of such cases in English 
and in other languages , "* * 

- \ f + c 


d. When reversible combinations aie used as variants, the problem 
is perhaps a bit more simple For example, using the accompanying Fig. 45 

t 1 » * 

, - w 


!<%7 
G, J 
C,N 
P,T 

Figuie 45 


K,3 Q,V B.H II,R D,L 


N 

H 

A 

0 

h 

D 

T 

ii 

F 

p 

O 

B 

u 

1 

V 

G 

-ir 

iV 

R 

c 

s 

4-1 

i 

Y 

\r 

IC 


13 ^iic olej t ctvocnc m~ht be able to detei”unc 
te'L of the two bc J s o' cipheitext gioupirgo. 


the uudci 1 ing p3 nn 
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foi encipherment, two messages with the same initial words, REFERENCE 
YOUR, may he enciphered as follows 

_R_ 5 " F 6 ' V ~ E N C ' ^ Y 0 if "E 

(1) N II 17 D R X L S II C DVVZH RSLHF SBB JC H 

(2) CID1IR XSLHN D If Z If U RLSHP R1I JB H 

The experienced cryptanalyst, noting the appearance of the very first few 
cipher groups, assumes that not only have the messages identical beginn¬ 
ings m their p]ain texts, hut also that he is here confronted with a 
variant system involving biliteral reversible equivalents One of the 
manifestations of puch* a cryptosystem is that in the d3graphic distribu¬ 
tion of the cipher text the "B" row will have an appearance similar to 
the "B" column, the h C” row will resemble the “C" column^ etc., thus, 
the cryptanalyst will almost immediately realize that fie has encountered 
a commutative system involving a matux smaller than that indicated by 
the size of matrix necessary for malting the digraphic distribution. 

* l ' » i *j ^ ~ 

e The probable-word method of solution may be used, but* with a 
slight variation introduced because of the fact that, regardless of the 
system, letters of low frequency in plain text remain infrequent in the 
ciyptogram . Hence, suppose a word containing low-frequency letters, but 
in itself a rather common word strikingly idiomorphic in character is 
sought as a "probable vord", for example, words such as OAYALRY, ATTACK, 
and PREPARE, Such a vord may be ■(•mitten 6ri a slip of paper and slid one 
interval at a time under the te ct, which has been marked so that the 
high- and low-frequency characters are indicated. Each coincidence of a 
low-frequency letter of the text,with a ^ow-frequency letter of the 
assumed word is examined carefully to see whether the adjacent'text let¬ 
ters correspond in frequency "with the other letters of the assumed word; 
or, if the latter presents repetitions, whether there are correspondences 
between repetitions m the cipher text and those in the word. Many trials 
are necessary but this method will produce results when the difficulties 
are otherwise too much for the cryptana]yst to overcome. 

62. Analysis involving the use of j sologs T--a. In military communi¬ 
cations it is not unusual that cryptograms are produced containing identi¬ 
cal plain text but which have been subjected to different cryptographic 
treatment, thus yielding different cipher texts. This difference in cryp¬ 
tographic treatment may be caused by the use of an entirely different 
general system, or by the use of a different specific key, or merely by 
the choice of equivalents in a variant system. Messages which present 
different encrypted texts but which contain identical plain text are 
called isologs (fiom the Greek iso = "equal" and logos = "word"). One of 
the easily-noted indications of the possible presence of isologs is 
equality or near-equality in the lengths of two (or more) cryptograms. 
Isologs, no matter hear the cryptographic treatment varies, are among the 
most powerful media available to the cryptanalyst for the successful 
solution of a difficult cryptosystem—and, in some cases, may provide the 
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only possible entries itito a complex cryptosystem. An inkling of the 
help afforded by isologs was revealed by the example contained in su^nr. 
6Id above; however, a much more striking Illustration is given in the 
next few subparagraphs. 

b. The following two cryptograms, suspected to be isologs, are 
available for study . » 

Message "A” 


82265 

63103 

7 4 8 3 9 

69842 

3 2 5 2 9 

70115 

8 0 2 7 7 

89106 

9 4 0 0 0 

1 

3 8 2 8 

5 4 0 8 2 

40065 

6 3 6 2 9 

3 3 9 1 8 

4 3 15 8 

8 

10 4 8 

2 6 4 5 8 

4 5 0 3 9 

8 17 13 

5 2 5 3 8 

7 3 3 0 9 

2 

0749 

6 17 5 2 

1 6 4 7 6 

38728 

9 114 7 

9 9 9 2 6 

4 

14 6 8 

1 3 3 6 5 

3 3 8 8 1 

89697 

93816 

5 17 5 0 

5 

7074 

1 1 8 0 4 

4 3 2 5 5 

28120 

2 7 7 3 0 

3 119 9 

7 

9962 

27865 

6 0 6 5 3 

90870 

40867 

46594 

,1 

9855 

10 0 2 2 

22987 

46729 

3 6 2 4 5 - 


* 




i. Vi 

t 


V 





- - - 

Message 

"B" 


- 

30150 

► v 

87497 

14 5 11 

9 

7 3 6 0 

4 9 6 7 6 

50106 

45647 

99181 

69672 

5 

3 8 8 9 

41563 

25203 

90628 . 

7 7 5 3 6 

2 0 3 5 1 

1 

0 5 7 0 

89277 

75011 

3 5 19 9 

9 0 13 8 

9 9 9 7 4 

5 

0 2 3 2 

0 4 1 l 5 

89216 

3 8 4 6 3 

1 7 5 4 7 

1 4 6 4 8 

0 

0 6 4 6 

85864 

5 3 8 9 8 

26121 

83878 

94889 

3 

3 7 2 8 

112 7 2 

20504 

0 6 4 8 l» 

3 2 10 3 

9 8 7 1 5 

4 

2 6 6 2 

80760 

89880 

4 4 10 5 

52900 

5 9 7 2 8 

2 

2 8 5 5 

8 7 3 0 0 

7 0 8 9 3 

59682 

46253 





- 


On the possibility that some dinome system (or systems) is involved, the 
messages are written under each other m dinomes to facilitate the exam¬ 
ination of the similarities and differences of such a grouping of the 
cipher texts, as shown on the next page 
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REaTRICTEB - 


J ' 5 10 15 


A 

82 

26 

56 

31 

03 

7^ 

83 

9<S 

98 

42 

32 

52 

97 

01 

15 

A* 

30 

15 

08 

74 

97 

i4 

51 

19 

73 

60 

>19 

67 

65 

01 

06 

B 

80 

2f 

78 

91 

06 

9M 

00 

01 

38 

28 

5*i 

08 

24 

00 

65 

B’ 

45 

6 I 1 

79 

91 

81 

69,' 

1 

67 

25 

38 

89 

4] 

56 

V m. 

32 

52 

03 - 

C 

63 

62 

93 

39 

18 

43 

15 

88 

10 

'48 

26 

45 

84 

50 

39 

C‘ 

90 

'62 

87 

75 

36 

20 

35 

11 

05 

70 

89 

27 

77 

50 

11 

D 

81 

T1 

35 

25 

38 

73 

30 

92 

07 

49 

61 

75 

21 

64 

76 

D» 

35 

19 

99 

01 

38 

99 

97 

45 

02 

32 

oh 

11 

58 

92 

16 

a 1 

E 

38 

72 

89 

11 

47, 

99 

92 

64 

i4 

68 

13 

36 

53 

38 

8l 

r ' E’ 

38 

46 

31 

75 

47 

l4 

64 

80 

06 

46 

v 

85 

86 

45 

38 

98 

j 

F 

89 

69 

79 

38 

16 

51 

75 

05 

70 

74 

11 

80 

44 

32 

55 

F 1 

26 

12 

18 

38 

78 

94 

88 

93 

37 

28 

11 

27 

22 

05 
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The dxnome distributions for the two messages are as follows• 




Distribution Tor 
Message "A” 


Distribution for 
Message "B" 
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c. Since a geneial absence of marked crests and troughs jls noted in 
"both distributions, if the divis.on of these cryplogiams into dinomes ,^s 
conect, and if they are both monoalphabetxc, xt is quite probable that 1 
some type of valiant system (ox systems) has been used. With this in 
mind, the encrypted te^ts and their distributions ore scrutinized fuxther 
fox some indication oC the kind of relationship which exists between the 
methods of encipherment of the tiro messages. The distributions axe seen 
to be strikingly similar, not only with respect to the location of the 
one predominant peak m each, but a] so m the close correlation of the 
locations of the blanks in each Furthermore, upon examination of the 
superimposed messages themselves, it is observed that there are several 
instances whexein a value m message "A" coincides with the same value 
in message "B" (e.g , see positions A/a 1 l4, b/b' 9) This observation, 
taken in conjunction with the marked similarity of the distributions, 
strongly indicates that not only has the same genexal cryptosystem been 
used for the encryption or both messages, but that the same enciphering 
m atrix has-been used for both. Also, in the case of the values 3$ and 
T>2., it is noted that wherever either occurs in one message the same vaiue 


(For the benefit of the student with a mathematical background, it 
might be interesting to point out certain applJcations of cryptomathe¬ 
matics in connection with these two distribut1ons. First of all, each of 
the two distributions is much flatter than that which would be expected 
for a sample of 125 dunomes of random text, i.e., a drawing (with replace¬ 
ment) and recording from an urn containing equal numbers of counters m 
each of 100 categories labeled 00-99 consecutively. In other words, 
whereas "random” follows a characteristic distributional appearance, ap¬ 
proximated by the normal or binomial distributions, the samples at hand 
exhibit phenomena even flatter (or "worse") than that expected for ran¬ 
dom, appxoaching the theoretical (and fantastically non-random) "equi¬ 
librium" of exactly the same number of tallies m each cell of a distri¬ 
bution. The following table gives the observed number of x-fold repe¬ 
titions in the two distributions, together with the expected number of 
x-fold repetitions in a sample of like size of random text, which expected 
number has been computed from tables of the Poisson exponential distri¬ 
bution (see Military Cryptanalysis, Part III) 


X 

Observed 
llsg. "A" 

Observed 
Msg. "B" 

Eypected 

T 

l4 

37 

29 

1 

51 

52 

36 

2 

33 

23 

22 

3 

1 

6 

9 

ii 

- 

1 

3 

5 

] 

1 

1 


It is to be noted that m the distribution for Iiessage "A" the observed 
number of blanks (l^l) against the expected number cf blanJ s from random 
text (29) 1 cprcsenlb a signnge or standard deviation of 2 70 or , which 
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occurs In the othci mess age, a phenomenon explainable on the 09 sump l ion 
thal the plaintext equivalents of theegjyolues are of such low frequency 
that no variant values have "been provided for these plaintext* letters in 
the ci yptosyb+om. ‘ '* 

d. Uath Lhc foregoing details determined, it is now iooli 7 cd that 
it should he possible bo form, between the two messages, ’'chains” ot 
those cipher values which represent identical plaintext letters, os 
cxemplificd below. Beginning with the first value m each message, 82 
and 30 , a partial chain of equivalent "variants is started, now locating 
some other occurrence of cither value elsewhere (e g., 82 at position H’8), 
and noting the cipher value coinciding with it (in this case, Hi), the 
porlial chain maybe extended (including now 82, 30, ond Hi), After this 
particular chain is extended to include as many values as possible, 
another chain is formed by starting with any value which hoe not already 
been included in the preceding chain, this procedure being repeated until 


can be translated as odds of 368 to 1 qgainst its occurrence by pure 
chance. Likewise the other entries besides 0 (in particular, the x-values 
of 1 and 2, and the pump.] ativp values of 3-and-better) may be evaluated 
in terms of sigmages, and the conclusion would be reached that the two 
distributions have a most remote chance of being as flat as they are 
through mere chance"^* for instance, it is 3*05 o* or 877 to 1 against <jLis- 
' triVution ,f A ft having pnly two "tallies occurring three or svbre times when 
13 such" tallies are expected'by iandoim—and this sigmage when taken into 
consideration with that of the number of blanks yields a sigmage of H <r 
or approximately 31,000 to 1 of occurring through' sheer chance. The sum 
total of all the deviations could be collectively evaluated, but this 
vould involve the laborious computation of a multinomial distribution. 
Since the distributions of the tiro messages are much worse than would 
even be expected for random chance, the conclusion is drawn that the 
dinome grouping is highly significant and therefore must be correct, and 
furthermore that the cryptosystem involves variants in sufficient numbers 
for the plaintext letters to permit the_ encipherer to select the cipher 
equivalents with a yi_cw to suppressing as much of the phenomena of repe¬ 
tition as possible Secondly, the X tost of the* two distributions gives 
a X value of 206 , as against the % value of 156 for_ random samples of 
this size, this represents-a sigmage o£_h.02 <r , or a ratio of 33,000 to 
1 against its happening by pure chance, i.e., if the cryptograms were not 
m the same general, system and specific keys. Therefore it is a foregone 
conclusion stabisticafly that not only f do the cryptosystems involve di- 
nomes as the ciphertext grouping, but that the identical cryptosystem is 
involved m the two messages, and that because of,the close correlation 
o J f~ the patterns of the two distributions, there is a good probability 
that the ‘cryptograms contain identical plain text and therefore are iso- 
logs This specific illustration of the potentialities of cryptomathe- 
matics indicates the important role that this branch of science may play 
in the art of cryptanalysis. 
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all possible chains are completed. It is found that the following chains, 
arbitrarily arranged hcie according to length, may be derived from the 
taro messages 

(06 14 15 26 28 33 35 73 7 l > 8i 89 98 99 ) - . _ 

(02 07 20 22 43 44 6? go) 

(12 37 48 51 69 70 83 9*0 
(03 30 4l 5 l( 65 82 97) 

(05 10 24 32 49 87 93) 

(16 18 36 76 78 79 86 ) 

(27 45 53 64 80 92 ) * - * 

(11 39 75 88 ) 

(21 58 77 84) 

(46 59 68 72 ) 

(00 52 67 ) 

(04 55 61 ) 

(08 29 56 ) 

(19 71 96) , 

(01 25) Single dinomes 

(13 85 ) ^ - 

(42 60 ) ( 38 ) ( 47 ) ( 50 ) ( 62 ) (91) 


If we now make an arbitrary assignment of a different letter to represent 
each chain (and each single dinome) and convert either of the messages to 
uniliteral terms by means or these arbitrarily-assigned values, we note 
the pattern of the opeping stereotype "REFERENCE YOUR MESSAGE*.*.*", and 
quickly recover the plain text, 

e. The plaintext values when inserted into a 10x10 matrix having 
arbitrarily-arranged coordinates yield the following, 

- (Hr, _ t ' 

0123 4 56789 

0 

1 

2 

"3 

1 ) 

5 

6 

♦ 

7 

8 

_ 9 

Manipulating the,, rows and columns t ith a view to uncovering some symmetry 
or systematic phenomena, the latent diagonal pattern of tl^equivalents 
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for Certain'or the ]otters (such as E p , Up, Op, ftp, end G p ) 3S revealed, 
and tlio'roi/4'and coiumns of bhc reconstruction djognm arc, permuted to 

yield the'Toll owing original enciphering matiit 

- s- 1 > * , - *t v - ~ - 
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l - _ t»* • 
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f * « r 8 
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There are^pnq observable relationships in or between the sequences of_ 

digits hi‘ the row and cplunjp. coordinates, therefore for want of any 

visible phenomena or further information on the derivation (if any) of 

these digits, it. is assumeef thjit they must have been assigned at random. 

The student vill note that final matrix is identical to that of 

Figure 39 in paragraph 59• ' 

— ■» 

f. It should be emphasised that in the example of the preceding 
subparagraphs it was only possibleJLo form chains of values from both 
messages reciprocally because the same’ encipnermg matrix had been used 
for both. A non-reciproca j chaining procedure voul<3 have been required 
if only the general system had been the same for both but the enciphering 
matrices had differed in some respect, or if tiro completely different 
variant systems had been usea (eg, one using a frequential matrix and 
the other involving a less Complex type of variant matrix, such as Fig. 
29). Specifically, it would'have been necessary to maintain two sepa¬ 
rate groups of chains, one'group for,each message, otherwise heterogeneous 
values would have become intermingled. ’ 

* » ■ x ~ t r 

g. Although an analysis of but one isolated example by means of 
isologs was presented, the stqdent should b^able to appreciate the 
significance and potentially enormous value oT isologs to a cryptanalyst. 
This value goes far beyond the simple variant encryption in a monoalpha- 
betic substitution system, isologs produced by the use of tiro different 
code books, or two different enciphered code versions of the same under¬ 
lying plain text, or tiro encryptions of identical p]ain text by tiro dif¬ 
ferent "settings’ 1 of a cipher machine, may ail prove of inestimable value 
^Ln the attack on a difficult cryptosystem 
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63» Further remarks on variant systems .—a. A few yords should bfi 
added wxOT regard! to certain subtei fuges Which are sometimes encountered 
In monoalphabetic substitution with variants, and which, if not recog-^ 
ifLzed in time, cause considerable delays The considerations treated f 
before in subpars. 521 and on the disguise of the length of the basing ( 
multiliteral group apply equally here to multiliteral substitution with 
variants, thus, in dmorac systems, a sum-checking digit or a null might 
be added m specified positions of the group to form a trinome. In 1X0 ' 
complex valiant systems, the presence of a null as one of the digits of 
a trinome would, add greatly to the complexities of cryptanalysis of that 
system. The most important of the subterfuges have to deal with the use 
of nulls which are of a different size than the real cryptographic units, 
inserted occasionally to prevent the cryptanalyst from breaking up the 
text into its proper units The student should take careful note of the 
last phrase, the mere insertion of symbols having the same characteristics 
as the symbols of the cryptographic text, except that they have no mean¬ 
ing, is not what is meant JThis class of nulls rarely achieves the pur¬ 
pose intended. Mhat is really meant - can best be explainedJby an example. 
Suppose that a 5 x 5 variant matrix vith the row en£L column'indicators 
shown in Fig. 46 is adopted for encipherment, formally, the cipher tuilts 
would consist of 2-letter combinsLions of the indicators, invariably 
giving the row indicator first (by agreement). * w m 
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'Figure J i6 

1 

The phrase CQUHAirm'Jt OF SPECIAL TROOPfa might be enciphered thus* 


COMMA I'DBROF 
VI EB PH IU FT F 1 ab Til UO Pi I GT 


• • * 

• • • 


These wuld normally then le amonged m 5“letter groups, thus 

v i e n p 111 u n* t 1 r <ibt 11 t j 0 r T r g t . . . 
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V> lh ill 1 k> no+ril* however, Ihil only 20 c»I Ihe 26 li.Itor , of the 
alphal*. 1- Iwvq T»oei» employed as r kst infl r olimrn. indicator,, li-wm' J, Y , 

0, \, nn»l * wni oil Tow, « , w(ppo* , o Uk c r i lei l or ore used ss nul Is, 

not ~m T>-Uls i ]m n ; ~ 1 r>tl rtydunl Irller uijcalod at ropflen jujt before 
the i o Vl W '13 nia ouged m S-lefctci gioups Occasionally, *t pair of 
letters wngjhl he ,nL"ci lQd,_ln otrde^ to mash the charocteiistics of 
"avoidance" of llic c 3elteis ftn each other. Thus, for example. 

VIEXB J'KKIU F J X T I EAJBT MU OOP U G K T Y 

Tile cryptanalyst, a^tcr some study suspecting abilitcral cipher, pro¬ 
ceeds to break up Llic* text into pairs 

VI EX BP HR. H) FJ XT IE_ AJ BT HU 0Q IU GK TY 

Compare thJs sel or 2~lctter u comb3notaons mlh the correct set. Only 4 
of the 15 pairs ore "propei" units It is easy to see thal without a 
knowledge of the existence of the nulls—ond even with a knowledge, if 
he does not lorsr which letters ore nulls—the cryptanalyst would be con¬ 
fronted with a pioblem for the solution of vhich a Tairly large amount of 
text might be necessary The coreful_eirployment oJ the variants also 
very materially adds to the security of the method because repetitions 
can be rather effectively suppressed 

£. Similarly m the examples undci parograph 58, the letter J in 
FigB. 27 and 29 mey be used cs o null, the letter Y in Fag. 28 , and the 
digit 0 in Figs 33 and 3 l > In Fag 30, any Ictuers m the range of 
P - Z might be used as nullB, out this U3a'e ought be weak because of the 
extremely low frequency of these letters as compared with the letters 
A - 0, this is an important point to consider in the examination of en¬ 
crypted text for possible poor usages of nulls 

d. From the cryptographic standpoint, usage of nulls in the manner 
outlined above result, in crypto^r ipmc tod even more than twice as long 
as the plain text, llius constj lutan^ a serious disadvantage. From the 
cryptanalytic standpoint-, Ihe m -an of the cipher units m the system 
described in subpar b above cons Ii lute, the most important obstacle to 
solution, this, coupled nlh tne u ■> of vara ants, makes this system con¬ 
siderably more difficult to solie, espite its monoalphobeticity. 
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SECTION IX 

POLYGRAPHIC SUBSTITUTION SYSTEMS 


Paragraph 

General remarks on polygraphic substitution... 6k 

Polygraphic substitution methods employing large tables.. 65 

Polygraphic substitution methods employing small matrices........... 66 

Methods for recognizing polygraphic substitution..,... 67 

General procedure in the identification and analysis of polygraphic 

substitution ciphers....... 68 

Analysis of four-square matrix systems.....* 69 

Analysis of two-square matrix systems.... 70 

Analysis of Playfair cipher systems.... 71 


Analysis of pojLygraphic systems involving large tables, 


tution systems 
sisting of sink 
tinction betwe 2 
elements const!' 
volving single 
involving ciphfe 
multiliteral.1 


*'triliteral ri , 


72 


Further remarks on polygraphic substitution systems... 73 

6k. General remarks on polygraphic substitution .—a. The substi- 


dealt with thus far have involved plaintext units con- 
le elements (usually single letters). The major dis- 
n them has been made simply on the basis of the number of 
tuting the ciphertext •units of each, i.e., those in- 
|-eleinent ciphertext units were termed uni literal , and those 
rtext units composed of two or more elements were termed 
That is to say, when the terras "uniliteral", "biliteral", 
fete., were used, it was to have been automatically Inferred 
that the plaintext units were composed of single elements. 


b. This 
involving plai 
systems are te 
terms "unilite 
rectly so—tha 
the compositio 
term containing 


section of the text will deal with substitution systems 
atext units composed of more than one element; such 
rmed polygraphic .2 (By comparing this new term with the 
ral" and "multiliteral" it may then be deduced—and cor- 
a term involving the suffix "-literal" is descriptive of 
of the cipher text units of a cryptosystem, and that a 
the suffix "-graphic" describes the composition of the 


See also 


subpar. 52a. 


Systems 
on this basis, 
connection witjh 
units of a syst 
unless otherwis 
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involving plaintext units composed of single elements may, 
be termed monographic; however, as has been stated in 
the terms "uniliteral" and "multiliteral", the plaintext 
em are understood (without restatement) to be monographic 
e specified. 
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p]aintexfc units. ) Polygrapnic systems in whjch the plaintext units are 
conposed of two elements axe called Aigiaphic , these in which the plain¬ 
text units are computed of three elements sae trigraphic, etc. The 
ci p hertext units of polygraphic systems usually consist of the same 
number of elements as the plaintext units. Thun, if a system is called 
"digraphic", it may ho assumed that the ciphertext units of the system 
consist of two elements, as do the plaintext units; if this were not the 
caf>e, the terra "digraph.! c" "by Itself would not he adeautte to describe 
the system completely, and an additional modifying word or phrase would 
have to he used to Indicate this feet .5 

jc» In polygraphic substitution, the combinations of elements which 
constitute the plaintext units are considered as indivisible compounds. 
The units axe composite in character and the individual elements compos¬ 
ing the units affect the equivalent cipher units jointly , rather than 
separately. The basic important factor in true polygraphic substitution 
ib that all the letters of each plaintext unit participate an the de¬ 
termination of its cipher equivalent, the identity of each element of 
the plaintext unit affects the compos it ion of the whole cipher unit'. 6 
Thus, Id a certain digraphic system, ABp may be enciphered as XP C , and 
AL'p, on the other hand, may be enciphered as NK C ; a difference in the 
identity of but one of the letters of the plaintext pair here produces a 
difference in the identity of both letters of the cipher pair.' 


q 

In this connection, it is further pointed out that since the root 
"literal" derives from the Latin "litera". It is conventionally prefixed 
by modifiers of Latin origin, such as "uni-", "bi-", and "multi-"; simi¬ 
larly, "graphic", deriving from the C-reek "graphikos", is prefixed by 
modifiers of Greek origin, such as "mono-", "di-", and "poly-". 

^ The qualifying adverb '‘usually" Is employed because this corre¬ 
spondence is not essential. For example, if one should draw up a’sot of 
6(6 arbitrary single signs. It would be possible to represent the 2- 
leLter pairs from AA. to ZZ by single symbols. This would still be a &i- 
grapnic system, 

5 Gee subparo» 6 je and 66 f for examples of two such systems and their 
names. " “ — 

^ An analogy is found Jn chemistry, when two elements combine to form 
a molecule, the latter usually having properties quite, different from 
those of either of the constituent elements. For example, sodium, a 
metal, and chlorine, a gas, combine to form sodium chloride, common 
table salt, Howe/er, sodium, and fluorine, also a gas similar in many 
respects to chlorine, combine to for-ji sodium fluoride, which is much 
different from table salt. 

r * 

^ For this reason The two letters arc marked by a ligature, that Is, 
by a bar across tneir tops. 3 n cryptologic notation, the symbol 89^ 
naans "any plaintext digraph", the symbol 83 c , "any ciphertext digraph". 
To refer speeix'cally tc the 1st, 2u, 3d,. . me mber of a ligature, the o 
exponent 1, 2, 3,»»« will be used. Thus 8 ^ of REMp is the letter E, 65 
of XRZ C is Z, See also footnote 1 on 
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d. The fundamental purpose of polygraphio substitution is again 
the suppression or the elimination of the frequency characteristics of 
single letters of plain text, Just as is the case in monoalphabetic 
substitution with variants; hut here this is accomplished by a different 
method, the latter arising from a somewhat different approach to the 
problem involved in producing cryptographic security. When the substi¬ 
tution involves replacement of single letters in a monoalphabetic system, 
even a single cryptogram can be solved rather readily; basically the 
reason for this is that the principles of frequency and bhe laws of 
probability, applied to individual units (single letters) of the plain 
text, have s very good opportunity to manifest themselves. However, 
when the substitution involves replacement of plaintext units composed of 
two or more letters—that is, when the substitution is polygraphic in 
nature—the principles of frequency and laws of probability have a much 
lesser opportunity to manifest themselves. If the substitution Is di¬ 
graphic, then the units are pairs of letters and the normal frequencies 
of plaintext digraphs become of first consideration; if the substitution 
is trigraphic, the units are sets of three letters and the normal fre¬ 
quencies of plaintext trigraphs are involved. In these cases the data 
that can bo employed in the solution are meager; that is why, generally 
speaking, the sSlution of polygraphic substitution ciphers is often ex¬ 
tremely difficult. 

e. By way of example, a given plaintext message of say n letters, 
enciphered by means of a uniliteral substitution system, affords n 
cipher characters, and the same number of cipher units. The same message, 
enciphered digraphically, still affords n cipher characters but only •§- 

cipher units. Statistically speaking, the sample to which the laws of 
probability now are to he applied has been cut in half. Furthermore, 
from the point of view of frequency ;\ the very noticeable diversity in 
the frequencies of individual letters; leading to the marked crests and 
troughs of the uniliteral frequency distribution, is no longer so 
strikingly in evidence in the frequencies of digraphs. Therefore, al¬ 
though digraphic encipherment, for example, simply cuts the cryptographic 
textual units in half, the number of Cipher units which must be identi¬ 
fied has been squared , and the difficulty of solution is not merely 
doubled hut, if a matter of judgment arising from practical experience 
can he expressed or approximated, mathematical] y, squared or cubed. 

f. The following two paragraphs will treat various polygraphic 
substitution methods. The most practical of these methods are digraphic 
in character and for this reason their treatment herein will be more de¬ 
tailed than that of trigraphic methods. 

65. Polygraphic substitution methods employing large tables .— 

a. The simplest method of effecting polygraphic substitution in¬ 
volves the use of tables similar to that shown in Figure 4 Ya. This 
table merely provides equivalents for digraphs, by means of the coordi¬ 
nate system. Specifically, in obtaining the cipher equivalent of any 
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Figure 47a. 


plaintext digraph, the initial letter ef the plaintext digraph is used 
to indicate the row in which the equivalent is found, and the final 
letter of the pla^nteyt digioph indicate the column, the cipher digraph 
is then fouid at tbc intersection of the row and column thus indicated.. 
For example, KGpyFC c> etc. 

b. In the preceding table two nixed sequences were employed to 
form the cipher equivalents, one sectUence being based on the key phrase 
I'lFSTHSSIiOUbH <OCP BRAKE and the other un GENERAL E1ECTRI0 CGHPAHI. The 
table in Figure hj& could have been drawn up in a slightly different 
manner, as shown in Figure 47b, and ’+111 yield the same cipher equiva¬ 
lents as beiore. Using this latter taoli, for any plaintext digraph 

is found at the intersection of Lhc : or and column identified by 6^ and 

p P 

6 !“, respectively, 6 C is found in the sequence below the table and is 
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Figure 47 b. 


p 

taken from the position directly under the column identified by 0 ^. A 

few sample encipherments will illustrate that this table is cryptographi¬ 
cally equivalent to that of Fig. 47 a. 


c. Figures 48 and 49 , below, contain other possible types of tables 
for digraphic substitution. In Fig. 48 , it will be seen that there are 
two vertical sequences to the left of this table and no horizontal 
sequence below it. 0^ is located in the leftmost sequence, 0^ being 

O 

found directly to its side in the right-hand sequence, ©„ is then found 
at the intersection of the row and column identified by ©p and @p. 


3 55 


RESTRICTED 




RE3TIIICTED 


REF ID:A56892 


«*» 

ABCDFFGHIJKLMNOPQRSTUVWjCYZ 






























A 

w 

G 

E 

N 

R 

A 

L 

C 

7 

I 

0 H 

P 

Y 

B 

D 

F 

H 

J 

K 

Q 

S 

U 

V 

vrx 

Z 

B 

E 

E 

N 

R 

A 

L 

C 

? 

I 

0 

P 

p 

¥ 

B 

D 

r 

H 

J 

K 

Q 

s 

U 

V 

ft 

X 

Z 

G 

C 

& 

H 

R 

A 

1 

C 

T 

1 

0 

M 

P 

Y 

B 

D 

r 

h 

J 

K 

Q 

s 

u 

V 

W 

X 

z 

G 

E 

D 

T 

R 

A 

h 

C 

T 

X 

0 

M 

P 

Y 

B 

D 

r 

H 

j 

K 

Q 

s 

u 

V 

w 

X 

Z 

G 

E 

N 

E 

I 

A 

L 

c 

T 

I 

0 

M 

f 

X 

B 

D 

F 

H 

J 

K 

Q 

s 

u 

V 

w 

X 

z 

G 

E 

N 

R 

F 

N 

L 

C 

T 

I 

0 

M 

P 

Y 

B 

D 

F 

H 

J 

K 

Q 

s 

u 

V 

fl 

X 

z 

G 

E 

ft 

R 

A 

G 

G 

C 

T 

X 

0 

M 

P 

Y 

& 

n 

F 

M 

J 

K 

Q 

s 

u 

Y 

w 

X 

7 

G 

r 

N 

R 

A 

L 

H 

H 

T 

I 

0 

w 

P 

Y 

B 

D 

F 

H 

J 

K 

Q 

s 

V 

V 

W 

X 

z 

G 

E 

N 

R 

A 

L 

c 

I 

0 

I 

0 

M 

p 

Y 

B 

D 

F 

H 

J 

K 

Q 

s 

u 

V 

ft 

X 

z 

G 

E 

N 

R 

A 

L 

C 

T 

J 

U 

0 

M 

P 

Y 

B 

D 

F 

H 

J 

A 

Q 

s 

u 

V 

* 

X 

2 

G 

E 

N 

R 

A 

L 

C 

£ 

I 

K 

A 

M 

P 

Y 

B 

D 

I 

H 

J 

K 

Q 

s 

u 

V 

Vi 

A 

z 

G 

E 

ft 

R 

A 

L 

C 

T 

I 

0 

L 

R 

P 

Y 

B 

D 

F 

H 

j 

K 

Q 

s 

u 

V 

w 

X 

Z 

G 

E 

H 

R 

A 

L 

c 

T 

I 

0 

M 

M 

B 

Y 

B 

D 

F 

H 

j 

K 

Q 

s 

u 

V 

$ 

X 

Z 

G 

E 

N 

R 

A 

L 

C 

T 

I 

0 

M 

P 

N 

K 

B 

B 

* 

H 

J 

K 

Q 

s 

u 

V 

w 

X 

z 

G 

E 

N 

R 

A 

L 

C 

T 

I 

0 

M 

P 

Y 

0 

C 

D 

* 

H 

J 

K 

Q 

g 

u 

V 

¥1 

X 

z 

G 

E 

N 

R 

A 

L 

C 

T 

I 

0 

M 

P 

Y 

B 

P 

D 

F 

H 

J 

K 

9 

5 

U 

V 

w 

X 

7 

G 

L 

H 

R 

A 

L 

C 

T 

I 

0 

M 

P 

Y 

B 

D 

Q 

F 

H 

J 

K 

Q 

s 

U 

Y 

w 

X 

z 

G 

E 

N 

R 

A 

L 

G 

T 

I 

0 

M 

P 

Y 

B 

D 

F 

P 

T 

J 

K 

Q 

S 

u 

V 

ft 

X 

z 

G 

E 

N 

R 

A 

L 

C 

T 

I 

0 

M 

P 

Y 

B 

D 

F 

H 

s 

L 

K 

Q 

S 

u 

V 

w 

r 

z 

G 

E 

N 

R 

A 

L 

C 

T 

I 

0 

M 

P 

Y 

B 

D 

F 

H 

J 

T 

M 

Q 

S 

U 

V 

w 

X 

z 

G 

E 

ft 

R 

A 

L 

C 

T 

I 

0 

M 

P 

Y 

B 

D 

F 

H 

J 

K 

U 

P 

S 

U 

V 

w 

X 

»T 

dJ 

G 

E 

N 

R 

A 

L 

C 

T 

I 

0 

M 

P 

Y 

B 

D 

F 

H 

J 

K 

Q 

V 

Q 

U 

V 

ft 

X 

z 

G 

E 

N 

R 

A 

L 

C 

T 

I 

0 

M 

P 

Y 

B 

D 

F 

H 

J 

K 

Q 

S 

W 

V 

V 

¥/ 

X 

z 

G 

F 

» 

R 

A 

h 

C 

T 

I 

0 

M 

P 

Y 

B 

D 

F 

H 

J 

K 

Q 

S 

U 

X 

X 

ft 

Y 

z 

G 

F 

N 

R 

A 

L 

0 

T 

I 

0 

M 

P 

Y 

B 

D 

F 

H 

J 

K 

Q 

S 

U 

V 

Y 

Y 

\ x 

Z 

G 

E 

R 

R 

A 

L 

C 

a 

I 

0 

M 

t» 

Y 

B 

D 

r 

H 

J 

K 

Q 

S 

U 

V 

W 

Z 

Z 

z 

G 

E 

N 

R 

A 

L 

C 

T 

I 

0 

M 

P 

Y 

B 

D 

b 

H 

J 

K 

Q 

S 

U 

V 

w 

X 


Xf 


Figure 48. 

respectively. The tab 3 e in Fig. 4-9 provides digraphic equivalents hy 
mans of the coordinate system (e.g., KCp=JZ c ), in the same manner as In 
Fig. 47 a, end a cursory examination of the inside of the table might 
disclose nothing new about thus table at ail. But, if one were to scan 
closely the diagonals formed by each 0^ from upper right to lowei left, 

he would soe that each such diagonal changes below the "Mp row”, simi- 
larly, if the diagonals formed by 0^ are scanned flom upper left to 

lower light, it will be seen that each of them also changes after the 
"IIp iow u . In effect, the inside of the table is divided into tiro sepa¬ 
rate poj tions by an imaginary line extending horizontally between the 
II and B rows, but within eoch portion a stiaightforward type of symmetry 
is exhibited and the sons* two mixed sequences have been employed in each. 
Actually, in a 26x26 table, it is not possible to maintain the diagonals 
formed thus by 0 q and ©§ in a completely ''unbroken" sequence without 

producing repeated digraphs within the table and without consequent 
cryptographic ambiguity, time, Fig. 49 illustrates one type of limited 
diagonal symmetry vhici raws* be lesorted to in the systematic con¬ 
struction of suen a tab 3 e. 
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d. All of the foregoing tables have exhibited a symmetry in the 
arrangement of their contents, which is undesirable from the standpoint 
of cryptographic security. This systematic internal arrangement could 
be detected by a cryptanalyst early in his attack on cryptograms pro¬ 
duced through their use, permitting rapid reconstruction of the particular 
table involved, this subject will be given a more detailed treatment in 
par. 72. The table in Figure 5° is an example of one type of table which 
would provide more security than the foregoing. This table is con¬ 
structed by random assignment of values and shows no symmetry whatsoever 
in its arrangement of contents. It will be noted that this table is 


rUJL LXi * x r z 


m 

Si 

a 

9 

m 

N 

a 

H 

0 

u 

A 

9 

B 

X 

o 

d“] 

r 

9 

i 

U 

[Fl 

9 

[Fl 

pn 

□i 

\?T\ 


E3 

D 

o 

Ej 

S 

q 

X 

J 

H 

J 

fl 

B 

T 

p 

u 

0 

E 

T 

0 

L 

E 

R 

w 

fl 

n 

□V 

01 

n 

9 

a 

0 

IT 

o 

0 

A 

R 

B 

X 

0 

D 

f 

j 

a 

U 

P 

q 

m 

X 

Y 

9 

X 

El 

El 

E 

E 

1 

JL 

8 

El 

X 

J 

H 

y 

ED 

B 

T 

p 

u 

0 

I 

T 

0 

L 

A 

R 

fl 

E 


01 

a 

m 



0 

9 

A 

a 

B 

r 

ci 

D 

r 

J 

L 

iT' 

P ' 

q 

7 

X 

Y 

Z 

9 

E 

El 


E 


Id 

n 

El 

E 

-S 

fl 

J 

H 

r 

■j 

B 

T 

P 

u 

0 

I 

T 

C 

L 

A 

H 

N 

a 

a 

101 


9 

9 

9 

u 

R 

B 

X 

0 

D 

r 

J 

L " 

U 

p 

q 

V 

3T 

'n 

9 

9 

9 

S 

E 

E 

IE 

E 

E 

E 

B 

E 

s 

.J9u 

X 

J 

H 


D 

B 

T 

p 


0 

I 

T 

fl 

IB 

E 

R 

m 


& 

9 


9 

9 


B 

X 1 

9 

a 

r 

j ^ 

L 

M 

P 

q 

V 

X 

Y 

9 

9 

tm 

a 

T 

El 

E 

E 

E 


B 

E 

Q 

U 

S 

E 

K 

J 

H 

T 

D 

B 

Y 

p 

u 

0 

fl 

fl 

c 

L 

A 

□I 

qS 

H 

9 

|5 

A 

H 

B 

X 

c 

9 

y 

J 

L 

W 

P 


7' 

9 

Y 

z"" 

9 

a 

s 

T 

I 

H 

E 

ti 

B 

E 

z 

X 

W 

7 

u 

E 


X 

J 

E 

F 

■3 

B 

E 

P 

M 

fl 

fl 

T 

C 

L 

m 

01 


9 

9 

91 

B 

9 

9 

D 

9 

j 

a 

9 

9 

Q 

T 

X 

a 

9 

ff 

9 

9 

9 

a 

a 

E 

D 

E 

E 

E 

o 

B 

E 

E 

V 

E 

s 

E 

E 

B 

H 

F 

D 

fl 

H 

P 

fl 

fl 

H 

fl 

fl 

01 

01 

01 

9 

9 

a 

91 

9 

9 

F 

9 

9 

9 

9 

Q 

V 

)T 

Y" 

m 

Lfl 

E 

a 

9 

m 

u 

a 

o 

B 

B 

E 

E 

■3 


E 

E 

W 

E 

fl 

E 

K) 

E 

J 

H 

F 

D 

B 

T 

p 

fl 

fl 

H 

fl 

01 

01 

9 

9 

9 

B 

91 

9 

9 

Efl 

9 

9 

m 


9 

X 

9 


W ‘ 

E 

S 


m 

9 

9 


Ed 

E 

E 

iD 


K3 


fl 

fl 

fl 

B 

fl 

H 

S 

E 

K 

H 

H 

F 

D 

B 

Y 

fl 

19 

fl 

fl 

EH 

9 

9 

9 

9 

B 

D 

9 

9 

L 

9 

P 

Q 

9 

a 

7 

m 

ff 

a 

S 

9 

n 

9 

9 

9 


n 

E 

E 

E 

fl 

B 


IE 

E 

Z 

fl 

W 

V 

E 

E 

JL 

K 

J 

K3 

F 

fl 

fl 

E 

fl 

fl 

fl 

a 

01 

9 

9 

9 

D 

F 

9 

9 

M 

9 

0. 

9 


9 

z 

f -1 

9 

a 

T" 

m 

9 

a 

9 

9 

a 

El 

El 

E 

E 

E 

A 

* 

E 

E 

o 

B 


E 

E 

E 

S 

q 

fl 

B 

H 

H 

fl 

-1 

fl 

B 

fl 

m 

01 

K 

9 

9 

F 

9 

L 

M 

p 

q 

9 

9 

Y-l 

9 

w 

9 

a 

u 

9 

9 

9 

r" 

9 


a 

El 

D 

-1 

E 

E 

Jj 

fl 

R 

N 

E 


E 

E 


E 

E 

fl 


X 

H 

fl 

fl 

» 

fl 

Y 

E 

9 

OR 

o 

9 

F 

9 


U 

P 

9 

r~ 

9 

a 

9 


9 

9 

r 

1“ 

m 

9 

9 

9 

9 

A 

9 

o 

E 

KJ 

B 

T 

E 

L 

A 

R 

E 

E 

E 

z 

E 

w 

E 

fl 

s 

q 

K 

fl 

fl 

fl 

fl 

B 

fl 

a 

a 

9 

9 

Q, 

9 

M 

9 

9 

9 

D 

9 

K 

9 

R 

9 

9 

9 

9 

G 

9 

a 

9 

a 

E 

9 

u 

E 

E 

E 

U 

fl 

D 

E 

E 

fl 

El 

u 

P 

E 

B 

E 

fl 

fl 

H 

X 

fl 

fl 

fl 

fl 

X 

fl 

9 

01 

9 

9 

9 

9 

9 

9 

9 

9 

9 

K 

9 

9 


9 

9 

9 

9 

9 


9 

9 

9 

TT ' 

a 

E 

E 

E 

E 

E 

E 

E 

E 

E 

fl 

E 

0 

K9 

E 

T 

E 

fl 

H 

E 

fl 

fl 

fl 

fl 

fl 

V 

H 

9 

IH 

9 

9 

9 

9 

9 

ui 

9 

9 

9 

B 

m 

a 

9 

9 

9 

9 

9 

m 

9 

9 

a 

9 

9 

a 

B 

E 

E 

E 

E 

E 

E 

E 

E 

E 

E 

I 

o 

r 

E 

E 

fl 

fl 

fl 

E 

H 

fl 

fl 

E 

fl 

fl 

9 

9 

9 

9 

9 

9 

9 

9 

9 

K 

B 

B 

A 

u 

9 

n 

9 

m 

m 

9 

9 

9 

9 

9 

9 

9 

K3 

E 

E 

E 

E 

E 

E 

E 

fl 

L 

C 

T 

i 

0 

E 

p 

fl 

B 

B 


fl 

H 

fl 

fl 

fl 

fl 

3 

01 

9 

9 

j 

9 

r D 

9 

9 

3 

m 

A 

U 

G 

9 

a 

9 

I 

5T 

9 

9 

9 

9 

a 

9 

a 

K3 

E 

B 

E 

Z 

E 

I 

E 

E 

A 

L 

C 

T 

I 

fl 


fl 

T 

B 

fl 

fl 

fl 

H 

fl 

fl 

fl 

m 

9 

9 

9 


9 

C 

9 

9 

H 

9 

9 

0 

H 

9 

9 

m 


9 

a 

9 

9 

9 

9 

9 

9 

El 

E 

E 

E 

D 

E 

El 

E 

E 

P 

E 

E 

c 

T 

E 

fl 

H 

P 

E 

ill 

fl 

fl 

fl 

fl 

fl 

fl 

m 

01 

9 

9 

9 

9 

9 

9 

9 

A 

9 

9 

r -1 

9 

9 

m 

9 

S'" 

9 

9 


9 

a 

9 

9 

9 

El 

E 

E 

fl 

E 

E 

E 

E 

E 

N 

E 

E 

L 

E 

E 

B 

H 

II 

fl 

E 

B 

fl 

fl 

fl 

fl 

fl 

a 

M 

9 

9 

c 

9 

9 

9 

a 

U 

9 

9 

9 

9 


m 

9 

E'" 

u 

9 

Y 

9 

9 

9 

9 

9 

E 

H 

E 

E 

V 

E 

E 

E 

E 

E 

E 

E 

E 

E 

c 


E 

0 

j 

fl 

Y 

fl 

fl 

19 

fl 

19 

EH 

01 

9 

9 

K 

9 

9 

9 

9 

9 

9 

9 

9 

m 

T 

9 

9 

9 

z 

« 

X 

Oi 

m 

m 

Oi 

m 

D 

E 

E 

E 

U 

E 

E 

fl 

E 

E 

E 

E 

H 

H 

L 

19 

fl 

E 

0 

M 

P 

r 

B 

& 

y 


m 

01 

9 

9 

Is 

9 

9 

9 

9 

9 

9 

9 

n 

9 

9 

9 

9 

9 

Q| 

m 

9 

9 

9 

9 

a 

a 

E 

B 

B 

H 


E 

D 

E 

E 

E 

E 

E 

E 

H 

H 

B 

fl 

m 

H 

0 

E 

fl 

E 

fl 

fl 

E 

a 

m 

9 

9 

9 

9 

9 

m 

9 

9 

9 

9 

9 

9 

m 

m 

9 

m 

9 

9 

9 

9 

9 

a 

9 

9 

■3 

E 

B 

E 

E 

E 

E 


fl 

E 

E 

E 

E 

E 

R 

A 

fl 

c 

fl 

E 

fl 

fl 

fl 

MB 

fl 

fl 

Ql 


9 

9 

9 

9 

9 

H 

9 

9 

m 

m 

9 

m 

W 


m 

X' 

m 

9 

a 

9 

a 

9 

9 

a 

E 

E 

E 

E 

E 

E 

E 

El 

E 

E 

X 

z 

E 

E 

IT 

R 

A 

L 

c 

E 

i 

fl 

■SI 

E 

E 

fl 


01 

9 

9 

9 

9 

9 

9 

9 

m 

T 

3 

9 

If 

9 

m 

x 

9 

q 

9 

u 

a 

9 

9 

a 

a 

EH 

E 

E 

E 

E 

E 

E 

E 

E 

n 

UL 

Lx 

E 

a 

E 

E 

R 

B 

L 

fl 

T 

H 

fl 

H 

fl 

B 


Figure 4 q. 


RESTRICTE D 


157 





JlEQTRICTE E r 


REF ID:A56892 


(Showing only n partially flllod tablo) 


Final Letter ( 0 2 o ) 



A 

B 

C 

D 

E 

F 

G 

H 

I 

J 

K 

A 

FX 

CH 

XE 

YY 

ZA 

YG 

FB 

CD 

EF 

XJ 

ZX 

B 

NY 

DC 

NB 

ZI 

XX 

DX 






C 




AH 




AB 




_ D 

^ ft 

2. E 

AX 

— 

BB 

— 

YA 

AI 



— 

AY 

— 

<0 _ 
-4J F 


AG 



NZ 



AZ 




'a 

1—< 

1 






1 



1 


£ N 


BC 


CY 



i 


|_ 

! 






1 








X 





AC 





AJ 


Y 

DE 


| 




Ar 



1 


Z 

AE 

J 







BD 

1 

■ 


A 

B 

C 

D 

E 

F 

G 

H 

I 

J 

k' 


X 

_Y 

Z 

EA 

DJ 

FH 


ND 

— 

BF 



— 

— 


AA 

1 

1 


mm "1 

ba! 

FE 

-1 

—1 

—I 


BE 




AD 


AK 




A 

B 

C 

D 

E 

F 

N 

X 

Y 

Z 


X Y Z 


Figure 50 * 

reciprocal in nature, that is AFp=YG c and YGp=AF c . Thus, this single 
table serves for deciphering as well as for enciphering. Reciprocity 
is, however, not an essential factor, in fact, greater security is pro¬ 
vided by non-reciprocal tables. But, in the case of such non-reciprocal 
randomly constructed tables, each enciphering table must have its comple 
mentary deciphering table. 

e. Digraphic tables employing numerical equivalents instead of 
letter equivalents may be encountered. However, since 6 j 6 equivalents 
are required (there being 676, or 26x26, different pairs of letters), 
this means that combinations of three figures must be used, such systems 
are termed trinome-digraphic systems, indicating clearly the number of 
elements which comprise the cipher units. By way of an example, the 
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Q 

following figure contains a fragment of a table 0 which provides trinome 
equivalents for the plaintext digraphs: 


A 

B 

C 


ABC P E 
001 002 003 004 005 
027 028 029 030 031 
053 054 


Y 625 626 
z 651652 


_Y_Z 

025 026 
051 052 


649 650 
675 676 


Figure 51. 


f« All of the foregoing tables have been digraphic in nature, but 
a kind of false trigraphic substitution may also be accomplished by means 
of similar tables, as Illustrated in Figure 52 , wherein 0 the table is the 
same as that in Figure 49 with the addition of one more sequence at the 
top of the table. In using this table, 0 ^ is located in sequence I, and 

ir 


It is interesting to note that this comparatively bulky and unwieldy 
table can be reduced to the following two alphabets with numerical 
equivalents for the letters: 

(1) ABCDEF.XYZ 

000 026 052 078 io 4 130 . 5980624 650 

( 2 ) A B C D E F.XYZ 

1 2 3 4 5 6 . 24 25 26 

In enciphering, the first letter of the plaintext digraph is converted 
into its numerical value from alphabet (l), and the second plaintext 
letter is converted by means of alphabet (2), the two numerical values 
thus derived are added together, and their sum is taken as the cipher 
equivalent of the particular plaintext digraph. Of course, this simple 
reduction would not be possible if the trinomes, in ascending order, had 
been arranged In the table in, say, a diagonal manner. 
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Figure 52 . 



lying at the intersection, of the row indicated by G-* in sequence I and 


the column determined by 9 p, Thus, FERE LMES would be enciphered KNZ 

IEQ KOV. Various other agreements may be made "with respect to the 
alphabets in which each plaintext letter will be sought in such a table, 
but the basic cryptographic principles are the same as in the case 
described. 

g. Tables such as those illustrated in Figs. 47 - 52 , above, have 
been encountered in operational systems, but their use has not been vary 
widespread because of their relatively large size and the inconvenience 
in their production and handling. In lieu of these large tables it is 
possible to employ much smaller matrices or geometrical designs to ac¬ 
complish digraphic substitution, methods involving their use will be 
discussed in the following paragraph. 
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66 Polygicphic substitution methods employing oinal] matrices . 9 — 
a. A simple method for accomplishing digiaphlc substitution in¬ 
volves the use of the four-square matrix , o matrix consisting of four 
5 r5 squares in which the letters of a 25-element alphabet (combining X 
and J) are inserted in any prearranged order. When four such squares 
arc arranged in o matrix as shown in Figure 53 > the latter may he em¬ 
ployed for digraphxc substitution to yield the same cipher results as 

does a much larger table of the type treated in the preceding paragraph. 

1 _ o 

In a four-square matrix, 0_ of 00 -, is sought in section 1 , 0 , in section 
12 & i P 

2 . Thus, 0 p and ©p will always form the northwest-southeast corners of 

an imaginary rectangle delimited by these two letters as located m 

1 2 

those two sections of the square. Then 0 C and 0 C are, respectively, the 
letters at the northeast-southwest corners of this' same rectangle. Thus, 
TG_=XS„, WD =CH OR =W _; UR = 3 ® : etc. In decrypting, 0 ^ and 0 ^ are 

sought in sections 3 and h } respectively, and their equivalents, and 

2 * 

©p, noted in sections 1 end 2, respectively. 
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Figure 53 . 

b. It is possible to effect digraphic substitution with a matrix 
consisting of but two sections by a modification m the method of find¬ 
ing equivalents. In a horizontal two-square matrix, such as that shown 

,1 _ o 

in Figure 5 ^> ©p of © 0 p is located in the square at the left, 0p, in the 

square at the right. 


g 

The word matrix as emp]oyed in this paragraph refers to checker¬ 
board-type diagiams smaller than the tables illustrated in the preceding 
paragraph. These matrices are usually composed of sections containing 

25 cells each. 
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Figuie 5U 

* f - 1 

12 

When 0p and 9p are at the opposite ends of the diagonal of an imaginary 


rectangle defined "by these betters, the ciphertext equivalent comprises 
the two letters appearing at the opposite ends of the other diagonal of 

^ ^ r* f, 

the same rectangle, 0", is the particular one -which is in the same row 03 

12 p — „ 

@p, and © c is the one in the same row as ©p. For example, ALp=TT c , 

DOpSGAQ. When ©p and 9p happen to he m the same row, the ciphertext 

_ * } 

equivalent is merely the reverse of the plaintext digraph, for example, 
ATpfTAc and EB^=HE C . - ' • - 

c. Digrephic substitution may also he effected hy means of v ertical 
two-square matrices , in which one section is directly above the other, as 
in Figure 55* it will he noted that matiicesjof this type have a feature 
of leciprocity when employed according to "the usual rules, which follow. 
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When 6 p and 9 p are at the opposite ends of a diagonal, the rule for en¬ 
cipherment is the same as that for horizontal two-square encipherment 
(e.g., MDpzUA,, and UAp=MO c ); when hoth 0 p and 8 p happen to he in the same 
column, the plaintext digraphs are self-enciphered, (e.g., MAp=MA^, and 
ELpSELg), a fact which constitutes an important weakness of this method.^ 
(This disadvantage is only slightly less obvious in the preceding case of 

horizontal two-square methods wherein the cipher equivalent of 80 p con¬ 
sists merely of the plaintext letters in reversed order. 

d. One -square digraphic methods, with a necessary modification of 
the method for finding equivalents, are also possible. The first of 
this type to appear as a practical military system was that known as the 
11 

Playfair cipher . It was used for a number of years as a field cipher 
* by the British Army, before and during World War I, and for a short time, 
also during that war, by certain units of the American Expeditionary 
Forces. Figure 5 6 shows a typical Playfair square. The modification in 
the method of finding cipher equivalents has been found useful in 
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Figure 56« 


imparting a greater degree of security than that afforded in the preced¬ 
ing small matrix methods. The usual method of encipherment can be best 
explained by examples given under four categories: 


10 See subpar, 73 b on other enciphering conventions which remove this 
weakness. 

This cipher was really invented by Sir Charles Wheatstone but re¬ 
ceives its name from Lord Playfair, who apparently was its sponsor before 
the British Foreign Office. See Wemyss Reid, Memoirs of Lyon Playfair , 
London, 1899. It is of interest to note that, to students of electrical 
engineering, Wheatstone is generally not known for his contributions to 
cryptography but is famed for something he did not invent—the so-called 
Wheatstone bridge", really invented by Samuel H. Christie. 
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(1) Members of the plaintext pair, 0 p and 0 ^, are at opposite ends 

of the diagonal of an imaginary rectangle defined "by the two letters, 

1 2 

the members of the ciphertext pair, and 9£, are at the opposite ends 
of the other diagonal of this imaginary rectangle. Examples* MDp-AL c , 
M£p=UC c , LUpSQMc, VI p =TC c . 

(2) 0^ and @p are in the same row, the letter immediately to the 
right of 0p forms 0£, the letter immediately bo the right of 0£ forms 
0 g. When either 9 p or 0 p is at the extreme right of the row, the first 
letter in the row becomes its cipher equivalent. Examples* M/\p=M c , 
Hip=AF c ; AFpSMMcJ FAp=M c . 

(3) ej and e| are in the same column, the letter immediately below 
0 p foxms ©q, the letter immediately below 9 p forms 0 ^. When either © p 
or 0f" is at the bottom of the column, the top letter in that column he- 

Jr 

comes its cipher equivalent. Examples MC p =CB c , AWp=TA c , WA p =AT c , 

( 4 ) 0p and 0p are identical, they are to he separated by inserting 
a null, usually the letter X or Q, and subsequently enciphered by the 
pertinent rule from above. For example, the vord BATTLES would he en- 
ciplieied thus* 

BA TX TL ES 
DM EW CO HP 

The Playfair square is automatically reciprocal so far as encipherments 
of type (l) above are concerned, hut this is not true of encipherments 
of type (2) and (3). 

e. It is not essential that the small matrices used lor digrophic 
substitution be in the shape of perfect squares, rectangular designs will 

12 

serve equally well, with little or no modification in procedure. For 
example, each section of, say, a four-square matrix could he constructed 
with four rows containing six letters each by having Up serve for V p , as 

well as Tp for J p . Furthermore, it is possible to expand the sections of 

a digraphic matrix to 28, 30, 01 more characters by the following subter¬ 
fuge, without introducing digits or symbols into the cipher text .-*-3 One 


Ilowcver, because the terms “four- squore matrix", "two -square matrix", 
and "Playfair square " have become firmly fi v ed in cryptologic literature 
and practice, they continue to be applied xo all such matrices, even 
when the "squares" of such matrices do not contain on equal nurabci of 
row 3 and columns (that is, even when +hey ore not square). 

^3 The addition of any symbols such as the digits 1, 2, 3,*.* i n oo a 
metric solely to augment the number of elements to 27, 28, 30, 32, or 36 
characters would not he considered piacticoble, since such a p_oceJmo 
would result in pioducmg cryptograms containing inter’'ixtuj.es oi Icxtci-o 
°nd figures. 
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of the letters of the alphabet may be omitted from the set of 26 letters, 
and this letter may then be replaced by 2, 3, or more pairs of letters, 
each pair having as one of its members the omilted singDe letter. The 
5x6 Playfair square of Figure 57a has been derived thus, the letter K has 
been omitted as a single letter, and the number of characters in the 
rectangle has been made a total of 30 by the addition of five combi¬ 
nations of K vith other letters. An interesting consequence of this 
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Figure 57a. 


modification is that certain irregularities are introduced in any crypto¬ 
gram produced through its use, for example, (l) occasionally a plaintext 
digraph is replaced by ciphertext trigraph or tetragraph, such as 


and (2) variant values may appear—BKE C , DKE 


AMp =HKU c and EPp=KEK0 c ; 

KEP C , GP C , and TP C all may be used to represent CKp, 


c , 

As far as the 


deciphering is concerned, there is no difficulty because any K occurring 
in the cipher text is considered as invariably forming a ligature vith 
the succeeding letter, taking the pair of letters as a unit; and, vhen a 
plaintext unit is obtained containing on e of the K-pairs, the letter 
after the K is disregarded, for example, CKOp is read as CK. The four¬ 


square matrix in Fig. 57b has also been constructed using the foregoing 
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subterfuge. With this latter matrix, numbers in the plain text may he 
enciphered, still without producmg cipher text containing numbers, for 
example, the plain text "HILL 3^06" would be represented by the cipher 
QAB AT KUKI NQE, which would be regrouped into groups of five letters 
and sent as QABAT KUKIN QE... 



f. Figure 58 shows a numerical four-square matrix which presents a 
rather interesting feature in that it makes possible the substitution of 
3-figure combinations for digraphs in a unique manner. To encipher a 
message one proceeds as usual to find the numeiical equivalents of a pair, 
and then these numbers are added together Thus 
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Figure 58* 


In deciphering, the greatest multiple of 25 contained in the group of 
three digits is determined, then this multiple and its remainder are used 
to form the elements for determining the plaintext pair in the usual 
manner. Thus, 284=275t 9 = ER* 

jg. Thus far all the small-matrix methods have involved only dl- 
graphic substitution. The two matrices together illustrated in Figures 
59a and b may be used to provide a system for encipherment which is 
partly trigraphic, the adverb "partly" has been used because this par- 
ticulai system will yield trlgraphic encipherment approximately 88.5$ of 
the time in ordinary text and digraphic encipherment appioxlmotely 11.5$ 
of the time. 11 *' In this case the cipher equivalents of the trigraphs 


These figures are based on the number of trigraphs ending in one of 
the 15 highest-fiequency letters (ETHROAISDLHCFHJ), and on the nunber of 
tiigiaphs ending with other letters. 


RESTRICTED 



166 










REF ID:A56892 


RESTRICTE D 


1 

2 

3 

4 


Fig. 5S[b. 


(or digraphs, as the case may he) are tetranomes. Encipherment is test 
illustrated by an example, this is given in the next subparagraph. 

h. Let the text to be enciphered be a message beginning with the 
words "REFERRING TO TOUR MESSAGE HUMBER FIVE STOP ..." This is rewritten 
into trigraphs, with the proviso that the third letter of the trigraph be 
one of the letters contained In the small square in Fig. 5Sb, if the 
third letter is not one of these 15 letters, the plaintext grouping is 
left as a digraph, then the grouping into trigraphs (or digraphs) con¬ 
tinues • Thus, the foregoing plain text would be written as follows * 

KEF ERR IK- GTO YOU RME SSA GEN UM- BER FI- VES TOP ... 

In encipherment, it is to be noticed that Rp occurs four times in section 
1 (as do all the letters) and Ep occurs four times in section 2; the 
proper combination of the l6 possibilities is determined by the coordi ¬ 
nates of the third letter of the trigraph as indicated in the small 
square, Fig. 59b. Since the coordinates of Fp in this square are 42, 
then It is the 4th occurrence of Rp in section 1 and the 2d occurrence 

of Ep in section 2 which are used to obtain the equivalent for the tri¬ 
graph REF.,,, this equivalent is 1905* When the plaintext unit as obtain- 

— n 

ed above is only a digraph, it is the 1st occurrence of 0:: whoch is used 

2 * 

in section 1 and the 1st occurrence of 0p which is used in section 2; 

thus, "IN-" from the sample message beginning, above, would be enciphered 
2828. The encipherment of the plaintext example above is then 

REF ERR IN- GTO YOU RME SSA GEN UM- BER FI- VES TOP 

1905 4o8l 2828 4719 0727 1372 7417 4n8 2270 3807 4024 8806 8623 
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The cipher text could then "be transmitted in groups of four digits, or, 
as a subterfuge to conceal the basic group length, the transmission 
could be in five-digit groups. In decipherment, the ciphertext tetra- 
nome is deciphered in the manner of the usual four-square matrix, and 

the location of the particular values for 0p and 

identity of the third plaintext letter, if any. 

jL. Nov that the student has become familiar vith the details of 
typical polygraphic substitution systems, he is ready to continue his 
cryptanalytic s tudy with the treatment of methods for recognizing poly- 
giaphlc substitution, these methods are described in the next paragraph. 

67* Methods for recognizing polygraphic substitution .—a. The 
methods used to determine whether a given cryptogram is digraphic in 
character are usually rather simple. If there are many repetitions in 
a cryptogram or a set of cryptograms and vet the uniliteral frequency 
distubution gives no cleai-eut Indications of monoalphabeticity, if 
most of the repetitions contain an even number of letters and these 
repetitions for the most part begin on he odd letters and end on the 
even letters of the message, yet the cipher text does not yield to so¬ 
lution as a biliteral cipher when the procedures outlined in Sections 
VII and VIII are applied to it, if the cryptograms usually contain an 
even number of letters (exclusive of nulls), and if the ciphei text is 
In letters and all 26 letters are not piesent and J or U are among the 
absent letters (or if the cipher is in digits and there is a limitation 
in the range of the text when divided into trmomes, this range usually 
being not greater than OOI-676), then the encipherment may be assumed to 
be digraphic In nature. 

b. Although the foregoing general remarks arc true as far as they 
go, occasionally they may be difficult to apply with any clcar-cut 
icsults unless a large volume of cipher text is available for study. To 
supplement them there are statistical tests which may be applied foi the 
recognition of digraphic substitution. Just as the <j> test and the A 
test may be app] led to the uniliteral distribution of ja ciyptogrom to 
help determine ’i/hether it is monoalphabetic with respect to single¬ 
letter plaintext units, so may these same, tests bo applied to the di- 
cranhic distribution of a cryptogram for the purpose of deteimining 
whether the cryptogiom in question is monoalphabetic when considered os 
a ddgraphic cipher. _ _ * ' 

c The basac form of the <j> test is the same when'applied to di- 
giaphic distributions as when applied to monographic—thac is, uni- 
litei al—disiributions (see par. 27) • It is only the plain and random 
constants that change, and "N" in the formulas now pertains to the number 
of digiaphs under consideration, instead of the number of single letters. 

■ 1 1 rl ~1 ^ •** m - 
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To illustrate this, the formulas for computing the ”digraphic phi plain 
(<{> 2 )" and the "digraphic phi random (<|> 2 )" are Bhown below* 

Jr * 

^ = .0069 N(N-l) 

$ 2 : .0015 N(N-l) i 

The "digraphic phi observed (<f>^)" is calculated in the usual manner, that 
is, by multiplying each f (which in this case is found in one of the 
cells of a digraphic distribution) by f-1 , and then totalling all the 
values thus derived. 

t 

d. The A 2 test (or the "digraphic blank-expectation test") may be 
applied to a digraphic distribution just as easily as its monographic 
counterpart is applied to a uniliteral frequency distribution. For this 
purpose. Chart 8 is given below, showing the average number of blanks 
theoretically expected in cligraphic distributions for plain text and for 
random text containing various numbers of digraphs (up to 200 digraphs). 
As can be seen, the chart contains two curves. The one labeled P applies 
to the average number of blanks theoretically expected in digraphic 
distributions based upon normal plaintext messages containing the indi¬ 
cated number of digraphs. The other curve, labeled R, applies to the 
average number of blanks theoretically expected in digraphic distri- 


15 - - _ ^ 

The digraphic plain constant, .OO 69 , was obtained by summing the 

squares of the probabilities of digraphs in English plain text; the 

digraphic random constant, . 0015 , is merely the decimal equivalent of 

1 / 676 . Further elaboration on the use of these constants, among others, 

will be given in Military Cryptanalysis, Part II. 
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Chart 6. 

"butionB based upon perfectly random assortments of digraphs. In using 
this chart one finds the point of intersection of the vertical coordi¬ 
nate corresponding to the number of digraphs in the message, vith the 
horizontal coordinate corresponding to the observed number of blanks in 
the digraphic distribution for the message. If this point of inter¬ 
section falls closer to curve P than it does to curve R, this is evidence 
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that the cryptogram Is digraphic in nature-^; if it falls closer to 
carve R than to curve P, this is evidence that the cryptogram is not 
digraphic in choracterT 


e. Although it may not he necessary to resort to the use of the 
P P 

<{> and A, test to determine whether or not a particular cryptogram has 
teen digraphically enciphered, it is well to know the application of 
these tests, since use has “been made of them in difficult cases in 
operational practice. They may he helpfully employed in cases where the 
cryptanalyst is uncertain as to whether or not a single null has "been 
added at the beginning of a cryptogram suspected to he a digraphic 
cipher; and these tests may also he found useful in the analysis of com¬ 
plex cases where the digraphic encipherment has been applied, not to 
adjacent letters of the plaintext message, hut to digraphs composed of 
more-or-less separated letters in the message. Elaborations of these 
ideas will he treated in Military Cryptanalysis, Part II. 


f» As for the recognition of trigraphic substitution ciphers—if 
most of the repetitions are a multiple of three letters in length, if 
these repetitions for the most part begin (when the cipher text is 
divided into trigraphs) with the first letters and end with the third 
letters of the trigraphs, and if the length of the cryptograms is for 
the 1 ’ most part a multiple of three letters, yet the cipher text does not 
yield to solution as a triliteral cipher, then the encipherment may he 
assumed to he trigraphic in nature. 

g . Just as the <p test may he used as an aid in the recognition of 
digraphicity, it may theoretically he used for recognizing the tri¬ 
graphic, tetragraphic, etc., nature of cryptograms, hut its use for 
these latter purposes is much more limited because of the “large amount 
of text which would he required to permit a valid application of the 
pertinent polygraphic <j> test. 


68. General procedure in the identification and analysis of poly ¬ 


graphic substitution ciphers .— A. Certain systems which at first glance 
seem to he polygraphic, in that groupings of plaintext letters are 
treated as units, are on closer inspection seen to he only partly poly¬ 
graphic in character. Such is true of systems involving large tables of 
the type illustrated in Figs. 47a and h, and 48 (in par, 65 ,‘ above), 


— 15 - 

Unfortunately, such would also he the case if the cryptogram under 
consideration were a polyalphabetic cipher involving two alphabets. 
However, to distinguish between a digraphic cipher and a polyalphabetic 
cipher with two alphabets, a digraphic distribution could he made "off 
the cut", that is, made of those ciphertext digraphs which are formed by 
omitting the first letter of text and then dividing the remaining text 
into groups of two letters. If the system were digraphic, such a distri 

p 

hution would exhibit a poor <Fr; if the system were a two-alphabet substi 

2 ° 

tution system, the <{> 0 would be as satisfactory as that of the regular 


distribution, taken "on the cut". 
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whcaein encipherment is by pairs but one of the letters in each pair La 
enciphered monoalphabetically, malting these systems only pseudo -poly- 
graphic. For example, using the table in Figure b8, any plaintext di¬ 
graph beginning with "A” must be encipheie<3 by a ciphertext digraph 
beginning with 1 V; any plaintext digraph beginning with "B" must he 
enciphei"ed hy a ciphertext digraph beginning with "E"; etc. A crypto- 
gieii involving the use of this table may then he identified as such 
merely from a study of the uniliteral frequency distribution mode on the 
initial letters of the cipher digraphs, since such a distribution would 
perforce be monoalphabetic.17 

b. In certain other systems—namely, the four-square, two-square, 
and Playfair squaie systems of par. 66, above—the method of encipherment 
is by pairs, but the encipherments of the left-hand and right-hand 
members of the pairs show group relationships} this is not pseudo¬ 
polygraphic but, rather, partially -polygraphic. Cryptograms enciphered 
by means of systems of this latter type may not be readily identified as 
such merely througn an examination of their cipher text, but their so¬ 
lution may be effected rather rapidly as soon as a few correct plaintext 
assumptions have been made therein. A more detailed treatment of this 
matter will be given in succeeding paragraphs of this section. 

£. The analysis of cryptograms which have been produced by di¬ 
graphic substitution is accomplished largely by the application of the 

*| Q 

simple principles of frequency of digraphs, with the additional aid of 
digraphic idiomorphs and such special circumstances as may be known to 
or suspected by the cryptanalyst. The latter refer to peculiarities 
which may be the result of the particular method employed in obtaining 
the equivalents of the plaintext digraphs in the encrypting process, 
such as those mentioned in subpars. a and b, above. In general, if there 
is sufficient text to disclose the normal phenomena of repetition and 
idiomorphism, or if cribs are available to be used as an entering wedge, 
solution will be feasible. The foregoing general statements will be 
expanded upon In the following two subparagraphs, d and e. 

d. When a digraphic system is employed in regular service, there 
is little doubt that traffic wil1 rapidly accumulate to an amount more 
than sufficient to permit of solution by simple principles of frequency. 
Sometimes only two or three long messages, or a half-dozen of average 
length, are sufficient. For with the identification of only a few cipher 


37 

For this purpose, the simplest and most economical way to obtain 
the unilitcral distributions for the initial and final letters of di¬ 
graphs is to make a digraphic distribution and then add the tallies in 
each row to yield the distribution for the initial letters, and add the 
tallies in each column to obtain the distribution for the final letters. 

18 

In this connection, it would be well for the student to familiarise 
himself with that poition of Appendix 2 which contains digraphic fre¬ 
quency data. 


RESTRICTE D 


172 


REF ID:A56892 


RESTRICTE D 


digraphs, larger portions of messages may "be read because the; s3''cloton3 
of words formed from the few high-frequency digrapha very definitely 
limit the values that can "be inserted for the intervening unidentified 
digraphs. For example, suppose that the plaintext digraphs EE, Id, OH, 
HD, NO, SI, NT, and TO are among those that have "been identified "by 
frequency considerations, corroborated by a tentatively identified long 
repetition; and suppose also that the enemy is known to be using a large 
table of Gj6 cells containing digraphs showing reciprocal equivalence 
between plaintext and ciphertext digraphs. Suppose the message begins 
as follows (in which the assumed values have been inserted). 


XQ, 

VO 

ZI 

LK 

AP 

OL 

ZX 

PV CK 

IK OL 

UK AT 

HN LK 



ND 

IN 


NT 


RE 

NT 

NO 

IN 

VL 

BN 

OZ 

BZ 

DY 

TY 

LE 

GI 





SI 


ON 

TO 








The initial words SECOND INFANTRY REGIMENT are readily recognized. 
Furthermore, if CK c =GIp, then GI c =CKp, which suggests ATTACK as the last 

word in the message beginning. This fragment of the message may now be 
completely recovered. SECOND INFANTRY REGIMENT NOT YET IN POSITION TO 
ATTACK. 


£. Just as the choice of probable words in the solution of uni¬ 
literal systems is aided or limited by the positions of’repeated letters 
(see subpar. t9d), so, in digraphic ciphers, is the placing of cribs 
aided or limited by the positions of repeated digraphs . In this con¬ 
nection, several frequent words and phrases containing repeated digraphs 
have been tabulated for the student's aid, and this list of digraphid 
idiomorphs is presented as Section D m Appendix 3 (q.v.). Thus, if one 
is confronted by a ciphertext message containing the following repeated 

sequence (therefore likely to represent an entire word). 

„ -1 *»*_'•' » v ' r * * •> v « s t » - 'kf, & 1 

VI FW HM AZ FF FIT RO' 


he may refer to the appropriate section of Appendix 3 which will dis¬ 
close, on the basis of the idiomorphic pattern "AB — —— AB" starting 
with the second cipher digraph, that the underlying plaintext word may 
be RE EN FO RC EM EN T, among others. Once a good start has been made 
and a few words have been solved, subsequent work is quite simple and 
straightforward. A knowledge of enemy correspondence, including data 
regarding its most common words and phrases, is of as much assistance in 
breaking down digraphic systems as it is in the solution of any other 
cryptosystems. J 

i \ wc * 

f. In the case of trigraphic substitution, analyst^ is made con¬ 
siderably more complex by the large amount of traffic required, not only 
for the initial entries, but also for further e>rploitation of the enter¬ 
ing wedges. In effect, the solution of a trigraphic system closely 
parallels the solution of the syllabary portion of a large two-part code, 
these techniques will be discussed in Military Cryptanalysis, Part V. 
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69 . Analysis of four-square matrix systems .—a. In all the small- 
matrix methods illustrated In paragraph 66 , the encipherment is only- 
partial] y digraphle because there are certain relationships between 
those plaintext digraphs which have common elements and their corre¬ 
sponding ciphertext digraphs, which will also have common elements. For 
example, in the four-s quar e matrix given in Fig. 53, it will be noted 
that AAp=FT c , AFp=FO c , ALprFMj,, AQp=FL c , and AVpcFK^,. In each of these 

cases when Ap is the initial letter of the plaintext pair, the initial 

letter of the ciphertext equivalent is F c . This, of course, is the 

direct lesult of the method, it means that the encipherment is mono- 
alphabetic for the first half of each of these five plaintext pairs. 

This relationship holds true for four other groups of five pairs begin¬ 
ning with Ap, in effect, there are five cipher alphabets employed, not 

25* Thus, this case differs from the case discussed under subpar. 68 a 
only in that the monoalphabeticity is complete, not for half of all the 
pairs but only among the members of certain groups of pairs . In a true 
digraphic system, such as a system making use of a 676 -cell randomized 
table, relationships of the foregoing type are entirely absent, and for 
this reason such a system is cryptographically more secure than small- 
matrix systems. r - - - 

h. From the foregoing it is clear that when solution has progressed 
sufficiently to disclose a few values, the insertion of letters within 
the cells of the matrix to give the plaintext-ciphertext relationships 
indicated by the solved values immediately leads to the disclosure of 
additional values. Thus, the solution of only a few values soon leads to 
the breakdown of the entire matrix. 

c. The following example will serve to illustrate the procedure. 

(l) Let the message he as follows. 



1 

2 

3 

< 

5 

s 

7 

8 

t 

10 

11 

12 

13 

M 

16 

10 

17 

18 

19 

20 

21 

22 

23 

24 

28 

20 

27 

28 

29 

30 

A. 

H 

F 

C 

A 

P 

G 

0 

Q 

1 

L 

B_ 

s_ 

P_ 

K 

M 

N 

D 

u 

K 

E 

0 

H 

Q 

N 

F 

B 

0 

R 

U 

N 

B. 

Q 

c 

L 

C 

H 

Q 

B 

Q 

B_ 

F 

H 

m 

A 

F 

X 

S 

I 

0 

K 

0 

Q 

Y 

F 

N 

s 

X 

M 

C 

G 

Y 

C 

X_ 

1_ 

F 

B 

E 

_X 

A 

F 

D 

X 

L 

P 

M 

X 

H 

H 

R 

G 

K 

G 

Q 

K 

g 

M 

L 

F 

E 

A 

A 

A 

D. 


JO 

_I 

H 

M 

U 

E 

0 

R 

D 

C 

L 

T 

U 

F_ 

E 

A 

J 

C 

G 

Q 

N 

H 

F 

X_ 

I 

k 

F 

_B_ 

JL 

X 

-r 

E. 

F 

L 

B 

U 

Q 

F 

C 

H 

Q 

0 

Q 

M 

A 

F 

T 

X 

s 

Y 

C 

B 

E 

P 

F 

N 

B_ 

S 

P 

J£ 

N 

U 

F. 

Q 

I 

T 

X 

E 

U 

g 

M 

_L. 

F 

E 

A 

A 

JL 

G 

0 

_i 

E 

U 

E 

H 

P 

I 

A 

N 

Y 

jr 

F 

L 

_B 

G. 

F 

E 

E 

P 

I 

D 

H 

P 

C 

G 

n 

Q 

I 

H 

B_ 

F 

H 

JM 

H 

F 

X 

C 

K 

U 

P 

D 

G 

Q 

P 

N J 

H. 

C 

B 

C 

Q 

L 

Q 

P 

.N 

F 

F 


N 

I 

T 

0 

R 

T 

E 

N 

C 

0 

B 

C 

N. 

T 

F 

H_ 

H 


Y > 

J. 


JL 

A 

c 

I 

A 

A 

I 

A 

U 

c 

Ji 

jr 

J? 

C 

B 

I 

F 

G 

w. 

_ K 

F 

C 

Q 

S 

L 

Q 

M 

C 

B 

K. 

0 

Y 

c 

R 

Q 

Q 

D 

"P 

R 

X 

F 

n" 

0, 

J. 

L 

F 

I 

D 

G 

c 

C 

G 

I 

y 

0 

*G_ 

0 

I 

JH 

H 

F 

L. 

I 

R 

c 

G 

G 

G 

N 

D 

L 

N 

0 

Z 

T 

F 

G 

E 

E 

R 

R 

p 

I 

F 

H 

0 

T_ 

F 

H 

H 

_A. 

Y~ 

M. 

z 

_L 

JL 

C 

I 

A 

A 

J 

J9L 

U 

C 


JL 

_P 










"L 






** 
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(2) The cipher having "been tested for standard alphabets (by the 
method of completing the plain-component sequence) and found to give 
negative results, a uniliteral frequency distribution is made. It is as 
follows: 



ABCDEFGHlJKLMNOPQR 

H 15 20 8 18 30 17 22 24 0 8 14 ~U 18 ' 16 18 33 9 



STUVWXYZ 

« 11 11 ol 12 7 3 


(3) At first glance this may appear to the untrained eye to be a 
monoalphabetic frequency distribution, but upon closer inspection it is 
noted that, aside from the frequencies of four or five letters, the 
frequencies for the remaining letters are not very dissimilar. There 
are, in reality, no very marked crests and troughs —certainly not as many 
as would be expected in a monoalphabetic substitution cipher of equal 
length. The <(> test, if taken (this test, as a rule, is not necessary 
with samples of text of sizes such as this), would show unsatisfactory 
results ($ o =6084, as against 4^=7870 and 4 r s ^5^3)* 

(4) The message is carefully examined for repetitions of 4 or more 
letters, and all of them are listed* 


Frequency Located in lines 


TFHHAYZLQCIAAIQUCBTP (20 letters). 2 H and L. 

QMLFEQQIGOI (11 letters). 2 C and F. 

XIFBEX (6 letters). 2 C and D. 

FEQQ . 3 C, D, F. 

QMLF . 3 C, F, K. 

BFHM. 2 B and G. 

BSPK. 2 A and E. 

GOIH... 2 D and K. 


Since there are quite a few repetitions, two of considerable length, 
since all but one of them contain an even number of letters, since these 
repetitions with but two exceptions begin on odd letters and end on even 
letters, and since the message also contains an even number of letters 
(344), the cryptogram is retranscribed into 2-letter groups for further 
study. It is as follows: * 
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Message transcribed in pairs 



1 

2 

3 

4 

3 

t 

7 

8 

0 

10 

11 

12 

13 

14 

lfi 

A 

HF 

CA 

PG 

OQ 

IL 

B5 

PK 

MN 

DU 

HE 

OH 

QN' 

FB 

OR 

UN 

B 

QC 

LC 

HQ 

BQ 

BF 

HM 

AF 

XS 

10 

K0 

QY 

FN 

SX 

MC 

GY 

C 

XI 

FB 

EX 

AF 

DX 

LP 

MX 

HH 

RG 

KG 

QK 

QM 

LF 

EQ 

QI„ 

D 

r G0 

IH 

MU 

EO 

RD 

CL 

TU 

FE 

QQ. 

CG 

QN 

HF 

XI 

FB 

EX 

E 

FL 

BU 

QF 

CH 

QO 

QM 

AF 

TX 

SY 

CB 

EP 

FN 

BS 

PK 

NU 

F 

QI 

TX 

EU 

QM 

LF_ 

EQ 

_QI_ 

GO 

.IE 

UE 

HP 

IA 

NY 

TF 

LB 

G 

FE 

EP 

ID 

HP 

CG 

NQ 

IH 

BF 

HM 

HF 

XC 

KU 

PD 

GQ 

PN 

H 

CB 

CQ 

LQ 

PN 

FN 

PN 

IT 

OR 

TE 

NC 

CB 

CN 

TF 

HH 


J 

4 zl 

QC 

IA 

AI 

QU 

CH 

_TP 

CB 

IF 

GW 

KF 

CQ 

SL 

QM 

CE 

K 

OY 

CR 

QQ 

DP 

RX 

FN 

QM_ 

LF 

ID 

GC 

CG 

10 

GO 

IH 

HF‘ 

L 

IR 

CG 

GG 

ND 

LN 

OZ 

TF 

GE 

ER 

RP 

IF 

HO 

TF 

HH 

-AI* 

M 

4 zl 

_ec 

IA 

AI 

QU 

CH 

TP 




* 






It is noted that all the repetitions listed above break up properly into 
digraphs except in one case, viz., FEQQ in lines C, D, and F. This 
latter seems rather strange, and at first thought one might suppose that 
a letter was dropped out or was added in the vicinity of the FEQQ in 
line D. But it may be assumed that the FE QQ in line D has no relation 
at all to the .F EQ Q. in lines C and F and is merely an accidental repe¬ 
tition. 

(5) A digraphic distribution is made as follows* 
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PuEQTmtQTE & 


ABCDEFGHIJKLHHOPQRg!jyVWXYZ 


A 

H 

■ 

■ 

■ 

■ 


B 

■ 

9 

I 

■ 

9 


a 

■ 

a 

a 

■ 









B 

H 

■ 

i 

■ 


a 

B 

■ 


B 


■ 


a 

a 

1 

a 

a 

*r 








C 

9 


■ 



■ 

1 

i 



B 

9 


9 

B 

a 

a 

a 









D 

H 

9 

■ 

■ 

■ 

g 

i 

i 

■ 

1 

1 



a 

i 

a 

a 

a. 









E 


■ 


a 

■ 


■ 

B 

■ 





a 

a 

a 

a 







0 



F 

9 

a 

i 

■ 

a 


a 

i 

■ 






a 

9 

a 

a 









G 

| 

■ 

1 

a 

■ 

i 



■ 





9 

a 












E 

■ 

■ 

i 

■ 

a 

a 


m 





3 

a 

a 

=* 











1 

a 

■ 

■ 

a 

1 

3 


S 






a 

a 












j 

m 

■ 


9 

■ 




* 




a 

a 

a 

a 










L 

K 

■ 



■ 















a 

a 

a 



a 

a 

a 

£ 

■ 

SL. _ 

£* _ 

■ 


s 








m 







a 






M 



«_ 

■ 







■ 

a 


m 







a 






If 




a 

■ 



■ 

■ 


■ 

■ 

a 

B 







*4 




a 


0 



■ 

■ 

■ 



a 

■ 


■ 

B 

a 

i 



a 

a 

a 

a 

a 




a 

■ 

p 



■ 

a 

■ 



a 

■ 


a 

B 

a 

9 



a 

a 

a 

a 

a 




a 

■ 

Q 



a 

a 

■ 






a 

a 

m 

a 



a 

a 

a 

a 

a 

a 



a 


R 



■ 

i 

■ 



■ 

■ 


■ 

a 

a 

B 



a 

a 

a 

a 

a 

a 



a 

■9 

S I 



■ 

■ 

i 



■ 

B 


■ 

a 

a 

a 



a 

a 

a 

a 

a 

a 



■ 

^9 

T 






* 


n 

i 







j* 








at 



U 



























V 



























w 




















1 



\ 




X 




















) 






> 

Y 




















_ m ; 


r 





Z 












c 








* 








4 

9 

8 

5 

7 

20 

8 

10 

7 


3 

6 

8 

13 

9 

9 

« 

12 

5 

r-v- 

3 1 

■u- 

9 

- 

1 

8 

7 

1 


7 

6 

18 

3 

9 

10 

9 

13 

l€ 

5 

6 

4 

5 

6 
7 

22 

4 

3 

10 
2 


4 

2 


Figure 60. 


(6) The appearance of the foregoing distribution for this message 
is quite characteristic of that for a digraphic substitution cipher. 
Although there are 676 possible digraphs, only 107 are present in bhe 
distribution, this parallels what is expected of normal plain text, since 
out of the 676 possible two-letter combinations (including "impossible 
plaintext digraphs" such as QQ, JK, etc., which might have been used for 
special indicators, punctuation marks, etc.) only about 300 are usually 
used m the construction of plain text. 1 ? The number of blank cells. 


!9 The 300 most frequent digraphs comprise 95$ of normal English plain 
text (Appendix 2, Table 7-A). 
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569 , closely approximates the 5&5 which would he expected in a distri¬ 
bution made on a sample of plain text of this size, as shown hy Chart 8. 
Furthermore, although there are many cases in which a digraph appears 
only once, there are quite a few in which a digraph appears two or three 
times, four cases in which a digraph appears four times, one case in 
which a digraph appears five times, and one in which a digraph appears 
six times. All of the foregoing observations concerning the distribution 
are reflected hy the 4 test: the observed digraphic phi value, 210, 
-compares very favorably with the expected plain value (=,0069 x 172 x 
171 = 203) as against the expected random value (=.0015 r YJ2 x 171 “ 

44), Thus all indications point to a digraphic substitution system. 

( 7 ) Since neither the <{> 0 078o) and A 0 (4) for the^initial letters 

of the cipher digraphs nor the <j> 0 (l496) and A 0 (2) for the final - 

letters are too satisfactory in their approximation to the values ex¬ 
pected for mcnoalphabetlc distt ihutions (^^1962 and 4r = 1133 > A p=5 and 

A r =0), the possibility of a pseudo -digraphlc system is^jruled out. There 

remain "the possibilities of a partlally -digraphic system employing a 
small matrix, or a true digraphic system employing a large, randomized 
table. In one common type of small-matrix system, the Playfair cipher, 
'one of the telltale indications besides the,absence t of (usually) the 
letter J is the absence of cipher doublets, that is, two successive 
identical cipher letters. The occurrence of the,double letters GG, HK, 
and QQ in the message under investigation eliminates the possibility of 
its being a normal Playfair cipher. For want of more accurate diagnostic 

pfl pi j * 

criteria at this stage , the simplest thing to assume, from among the 
various hypotheses that remain to be considered, is that, a four-square 
matrix is Involved, One yith norpa] alphabets (as being the simplest 
case) in Sections 1 and 2,is therefore set-down (Figure 6la). , 


20 > . i 1 r , } 1 1 * , i 

Even a medical practitioner often cannot successfully diagnose a 
condition on the first visit. Cryptanalyfcieally speaking, we are still 
on our "first visit". Subsequent probing will, we hope, reject or sub¬ 
stantiate this or that hypothesis or assumption, until the patient (the 
cipher text) is recovered v (i.e., brought hack to plain text). v ■* 

21 

However, see the treatment on the diagnosis of various types of 
digraphic systems in subpar 73^. 


-ft 
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A 

B 

c 

D 

E 






F 

G 

H 

I-J 

K 






L 

M 

N 

0 

p 






Q 

R 

S 

T 

u 






V 

W 

X 

y 

z 











A 

B 

C 

D 

E 






F 

G 

H 

ES 

K 



1 



D 



0 

P 


i 


| 


Q 

R 

S 

T 

U 






V 

W 

X 

y 

Z 


Figure 6 la. 


(8) The recurrence of the group QMLF, three times, and at intervals 
suggesting that it might be a sentence separator, leads to the assumption 
that it represents the word STOP. The letters Q, M, L, and F are there¬ 
fore inserted in the appropriate cells in Sections 3 and 4 of the diagram. 
Thus (Fig. 6lb)* 


A 

B 

C 

D 

E 






F 

G 

H 

I-J 

K 






L 

M 

N 

0 

P 





L 

Q 

R 

S 

T 

U 




Q 


V 

W 

X 

Y 

Z 











A 

B 

C 

D 

E 






F 

G 

H 

I-J 

K 




F 


L 

M 

N 

0 

P 



M 



a 

R 

S 

T 

U 






V 

W 

X 

Y 

Z 


Figure 6lb. 


These placements seem rather good from the standpoint that keyword-mixed 
sequences may have been used in these two sections. Moreover, in Section 
3 the number of cells between L and Q is just one less than enough to 
contain all the letters M to P, inclusive, this suggests that one of 
these letters, probably N or 0, is in the keyword portion of the sequence. 
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that is, near the top of Section 3» Without making a commitment in the 
matter, let us suppose that M follows L and that P precedes Q; then let 
both N and 0, for the present, be inserted in the cell between M and P. 
Thus (Fig. 6lc)• 



A 

B 

C 

D 

E 






F 

G 

H 

i-j 

K 






L 

M 

N 

0 

P 





L 

Q 

R 

S 

T 

U 

M 

n 

0 

P 

Q 


V 

W 

X 

Y 

z 











A 

B 

C 

D 

E 






F 

G 

H 

I-J 

K 




F 


L 

M 

N 

0 

P 



M 



Q 

R 

s 

T 

U 






V 

W 

X 

Y 

Z 


Figure 6lc. 


(9) How, if the placement of P in Section 3 is correct, the cipher 
equivalent of THp will be P0 C , and there should be a group of adequate 

frequency to correspond. Noting that PN^ occurs three times, it is as¬ 
sumed to represent THp and the letter N is inserted in the appropriate 

cell in Section 4. Thus (Fig. 6ld)* 
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Figure 63d. 
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(10) It Is about tune to try out these assumed values in the message. 
The proper insertions are made, with the following results. 



1 

2 
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30 
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13 
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HF 
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OQ 

IL 
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UN 
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QC 

LC 

HQ 

BQ 

BF 

HM 

AF 

XS 

10 

K0 

QY 
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SX 

MC 

GY 

C 

XI 

FB 

EX 

AF 

DX 

LP 

MX 

HH 

RG 

KG 

QK 

QM 

LF 

EQ 

QI> 
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(ll) So far no impossible combinations are in evidence. Beginning 
with group H4 in the message is se~en the following sequence. 

' PHFNPH 
T H . ,TH 

* + 

Assume it to be THAT THE. Then ATp=FN c , and the letter If is to be in¬ 
serted in row 4 column 1 of Section 4. But this is inconsistent with 
previous assumptions, since N in Section 4 has already been tentatively 
placed in row 2 column 4. Other assumptions for FN C are made* that it 
is, IS p (THIS TH...), that it is EN p (THEN TH...), but the same incon¬ 
sistency is apparent. In fact the student will see that FN C must re¬ 
present a digraph ending in F, G, H, I-J, or K, since N c is tentatively 
located on the same line as these letters in Section 2. Now FIf c occurs 
4 times m the message. The digraph it represents must be one of the 
following* 


DF, 

DG, 

DH, 

DI, 

DJ, 

DK 

OF, 

OG, 

OH, 

01 , 

OJ 

IF, 

IG, 

IH, 

II, 

IJ, 

IK 

TK, 





OF, 

JG, 

JHj 

JI, 

JJ, 

JK 

YF, 

YG, 

YH, 

YI, 

YJ 


l8l 
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Of these the only one likely to be repeated 4 times Is OF, yielding 


PSPIPH 

T II 0 F T H which may be a part of 

CQLQPNFNPNIT CQLQPNFNPNIT 

NOHTHOFTHE . or .SOUTHOFTHE 


In either case, the position of the F m Section 3 
F . L in row 3 Theie are 3 cells intervening 
which G, H, I-J, and K may be inserted. It is not 
G, H, and K are in the keyword as that I should be 
assumed that this is the case, and let the letters 
in the appropriate cells in Section 3 Thus (Fig 


is excellent 
between F and L, into 
nearly so likely that 
in it Let it be 
G, H, and K be placed 
6le) 
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Figure 6le. 


Let the resultant derived values be checked against the frequency dis¬ 
tribution. If the position of H in Section 3 is correct, then the di¬ 
graph ONp, normally of high frequency, should be represented several 

times by HF C . Reference to Fig. f 6o shows HF C to have a frequency of 4. 
And IIMq, with 2 occurrences’, represents flSp. There is no need to go 
through all the possible corroborations. 

P N F N P N 

(l2) Going back to the assumption that T H . . T H is part of the 
expression - r ~ — 

CQLQPNFNPNIT CQLQPNFNPNIT 
.NORTHOFTHE. or .SOUTHOFTHE., 

it is seen at once from Fig 6le that the latter is apparently correct 
and not the former, because LQ C eauals OUp and not ORp. If OSprCQc, this 
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means that the letter C of the digraph CQ c must he placed in row 1 column 
3 or row 2 column 3 of Section 3 • Now the digraph CB c occurs 5 times, 
CG c , 4 times, C^, 3 times, CQq, 2 times. Let an attempt he made to 

deduce the exact position of C in Section 3 aud the positions of B, G, 
and H in Section 4, Since F is already placed in Section 4, assume G 
and H directly follow it, and that B comes before it. How much before? 
Suppose a trial he made. Thus (Fig. 6lf) 


A 

B 

C 

D 

E 



C? 



F 

G 

H 

I-J 

K 



c? 



L 

M 

N 

0 

P 

F 

G 

H 

K 

L 

Q 

R 

S 

T 

U 

M 

8 

P 

Q 


V 

W 

X 

Y 

Z 











A 

B 

C 

D 

E 




N 


F 

G 

H 

I-J 

K 

B? 

B? 

B? 

F 

G 

L 

M 

N 

0 

P 

H 


M 

Q 


Q 

1 

R 

S 

T 

U 






V 

W 

X 

Y 

Z 


Figure 6lf 

' 

By referring now to the frequency distribution. Fig. 60, after a very 
few minutes of experimentation it becomes apparent that the following is 
correct* ' 1 
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Figure 6lg. 
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(13) The identifications given by these placements are inserted in 
the text, and solution is very rapidly completed The final matrix and 
deciphered text are given below. 
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Figure 6lh. 
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d. In the solution of four-square cryptograms, advantage may "be 
taken not only of the general type of digraphic jdiomorphs mentioned in 
subpar. 68e, above, but also of a special type of partial l&iomorphism 
present in any four-square cryptograms involving the use of a matrix in 
which the plain components consist of normal alphabets normally in¬ 
scribed. 22 As an illustration, let the digraphs SO OT (II.) be enciphered 
by means of any four-square having normal alphabets m Sections 1 and 2, 
and it will be found that in the encipherment the initial letter of the 

cipher digraph representing SOp will be identical to the initial letter 
of the cipher digraph representing UTp, regardless of how the cipher com¬ 
ponents are constructed. On this basisa bilef list of specialized 
single-letter patterns have been compiled for use in’the solution of such 
a digraphic system, this list of "four-square digraphic idiomorphs" con¬ 
stitutes Section F of Appendix 3* 

e. It is interesting to note how much simpler the technique of 
analysis is in the case of so-called Inverse four-square ciphers, which 
involve the use of a matrix wherein the ciphertext sections contain normal 
alphabets, the plain components being mixed. For example, referring to 
Fig* 53, suppose that Sections 3 and k are used as the source of the 
plaintext pairs, and Sections 1 and 2 as the source of the ciphertext 
pairs, then ONp=ET c , EHp=GE c , etc. The simplicity of the analytic pro¬ 
cedure will be made clear by the following exposition. 

(1) To solve a message enciphered with an inverse four-square matrix, 
it is necessary to perform two steps. First, convert the ciphertext pairs 
into their plain-component equivalents by "deciphering" the message with a 
matrix in which all four sections contain normal alphabets, this operation 
yields two uniliteral substitution "ciphers", one composed of the odd 
letters, the other of the even letters. The second step is to solve these 
two monoalphabetic portions 

(2) As an example, let us consider the following cipher text, known 
(or assumed) to have been encrypted with a trinome-digraphic 2 ^ system 


If any other knotm plain components were involved, the procedure of 
deriving a list of idiomorphic patterns would be modified to fit the 
particular case. 

p Q 

If the cipher text were being examined "from cryptanalytic scratch", 
the limitations ( 003 - 595 ) of the cipher text when the latter is divided 
into trinomes for examination would have at once indicated that this 
grouping is the one which merits detailed analysis. The test would 
then give an indication of the digraphic nature of the underlying crypto¬ 
graphic treatment. 
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incorporating a four-square matrix similar to that illustrated in Fig. 5$, 
except that the plain-component sections have heen changed 


2 0 3 S 3 
2 7 8 3 5 

20805 
4 6 8 1 8 
2 4 3 0 2 
15 3 13 
4 6 5 3 6 
it 7 8 2 1 

08703 

0 0 7 4 1 


85081 
4 1 8 0 4 
7 4 13 5 

3 3 9 3 0 

83832 
6 1 0 4 1 

6 2 6 6 3 

7 3 9 3 3 
7 0 9 1 4 

3 3 3 8 1 


83450 
5 0 4 1 3 
3 5 4 7 3 
9 13 9 3 

2 8 3 5 9 
0 0 14 4 
44007 
81193 

1 9 3 9 1 

3 3 5 9 3 


2 7 9 3 *1 
27416 
32626 
4 1 1 0 4 

3 8 0 2 2 
10 10 1 
1 8 3 4 5 
It 7 9 2 4 

11607 

3 9 3 4 0 


11503 

3 3 0 9 1 
9 116 0 

4 13 3 1 

6 10 4 3 
8 2 4 0 3 
b l 4 0 2 
0 4 0 3 2 

7 13 7 1 
6 3 5 3 1 


09168 
0 10 9 2 
0 3 2 1 8 
17296 
69130 

3 6 1 6 8 
88152 

4 13 0 6 

5 3 5 9 5 
8 8 13 3 


(3) The first thing to he done is to construct a four-square matrix 
with the known ciphertext sections, and inscribe arbitrary alphabets in 
the pi „ext sections, as follows* 
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Z 




(4) The cipher text is then written in trinomes, and these trinomos 
are “deciphered” by means of the foregoing matrix, yielding the converted 
cipher text as follows: 
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5 





10 





15 

A 

203 

238 

508 

183 

T* 

450 

279 

* 

34i 

* 

150 

309 

168 

278 


1*80 

^50 

fl§ 


ID 

IP 

IF 

IH 

OP 

PB 

MT 

FB 

PH 

IR 

OB 

PIT 

FH 

QD 

TM 

1 

B 

274 

163 

309 

101 

' 092 " 

‘208 

057 

ll3 _ 

535 

473 

326 

269 

116 

003 

218 


PV 

IM 

PH 

BE 

CT 
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CH 
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DA 

IT 

C 

468 

183 

393 

091 

393 

4ll 

044 

-*L 

133 

117 

296 

1 

2^3 

028 

383 

228 

359 


TT 

IH 

TQ 
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TQ 

RM 

ER 

IF 

CU 
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IU 

Db 

TF 

IE 
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380 
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369 

130 

153 
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io4 

100 

144 

101 

018 

240 

336 

168 
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FU 

FF 
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GL 

EE 

♦ 

AE 

KQ 

BE 

DQ 

FU 

MO 

IR 

m 
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465 

366 

266 

344 

007 

183 

450 

l40 

288 

152 

478 

217 

393 

381 

193 


QT 

MU 

MQ 

PT 

CF 

IH 

QD 

FQ 

OM 

HB 

TE 
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TO 

_RF 

IS 

F 

479 

240 

403 

24l 

306 

087 

037 

091 

419 

391 

ID 6 

077 

137 

153 

595 


UE 

FU 

TB 

GU 

MH 

CO 

CM 

BT 

UR 

RQ 

BU 

CD 

HL 

IB 

VY 

G 
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413 

338 

133 

593 

393 

4o6 

353 
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133 
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IN 

IF 






The distributions“of 

the 

letters 

constituting the initial letters and 

final letters of 

the 

converted digraphs afe as follows 





(Initial Letters) 


£ 

_s ^ „ I ^ 

_fiSS ilSH 2= - H jg__ __ _ 

ABCDEFGHIKLMNOPQRSTUVWXYZ 


(Final Letters) 


- ~ ~ % % g 

ABCDEFGHIKLMNOPQRSTUVWXYZ 


(5)pausing straightforward principles of frequency and partial idio- 
morphs, the plain text (beginning with the opening words ENEMY 
RECONNAISSANCE...) is recovered, and the following equivalents are obtain¬ 
ed for the converted cipher letters of the two alphabets 

* -* _L» 

(Initial Letters) C ABCDEFGHIKLMNOPQRSTUVWXYZ^ 

P BRAHMSCDEF I LNOP TUV Y 

“ ~ " r * - - r * “ 

(Final Letters) C ABCDEFGHIKLMNOPQRSTUVWXYZ 

P* WA NERBCDFHIKLMOPQSTUV Y 


^ Uote the ABA pattern of the first word in the message (EKEfiY), made 
patent by the two-alphabet conversion process. Also note the 3-fold 
repetition (representing the plaintext word STOP) which, although hidden 
in the original cipher text, now comes to light. 
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Keyword-mixed sequences directly manifest themselves because the original 
enciphering matrix contained such sequences in Sections 1 and 2, inscribed 
in the same manner as were the arbitrary A-Z sequences which were used for 
the conversion. In fact, the key words of the two distributions might 
have been recovered from an analysis of the "profiles” of the distributions 
above, as described in subpar 5^®. 

(6) The original enciphering matrix is then reconstructed, thus 
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(7) Although the example illustrated was that of a numerical di¬ 
graphic system, it is obvious that this technique of solution also applies 
to literal four-square systems in which the cipher components are known 
sequences. It should be clear to the student the tremendous difference it 
makes when it is possible to convert a digraphic system into a two-alphabet 
system, in a digraphic system, we are plagued by a potential 676 different 
elements in the cipher, whereas in a two-alphabet system we still have 
only 26 elements (in each of two sets, it is true) in the cipher text to 

be solved This principle of conversion of cipher text into a secondary 
cipher text has application in some of the most complex types of crypto¬ 
systems, the student would do well to keep this in mind 

(8) As a further observation on inverse four-square systems, it is 
pointed out that where the same mixed alphabet is present in Sections 3 
and 4, the problem is still easier, sauce the letters resulting from the 
conversion into plain-component equivalents all belong to the same, single 
mixed alphabet, thus such a digrephic system is reduced to an ordinary 
simple substitution cipher. 

f. The solution of cryptograms enciphered by other types of small 
matrices is accomplished along lines very similar to those set forth in 
subparagraph c on the solution of a four-square cipher, this will be 
illustrated in subsequent paragraphs There are, unfortunately, few means 
or tests which can be applied to determine in the early stages of the 
analysis exactly what type of digraphic system is involved in the first 
case under study The author freely admits that the solution outlined in 
subparagraph £ is quite artificial in that nothing is demonstrated in step 
(7) that obviously leads to or warrants the assumption that a four-square 
matrix is involved. The point was passed over with the quite bald state¬ 
ment that this was "from among the various hypotheses that remain to be 
considered"—and then the solution proceeded exactly as though thisjrere 
hypothesis had been definitely established For e/ample, the very first 


RESTRICTED 


188 




REF ID:A56892 


REOTRICTED 

x 

results obtained were based upon assuming that a certain 4-letter repe¬ 
tition represented the word STOP and immediately inserting certgin letters in 
appropriate cells in a four-square matrix with normal sequences in Sec ¬ 
tions 1 and 2 . Several more assumptions were built on top of 'that, and 
very rapid strides were made 'What if it had not been a four-square 
matrix at all? What if it had been some other type of not readily identi¬ 
fiable digraphic system? The only defense that can be made of what may 
seem to the student to be purely arbitrary procedure based upon the 
author's advance information or knowledge is the following. In the first 
place, in order to avoid making the explanation a too-long-drawn-out 
affair, it is necessary (and pedagogical experience warrants) tnat certain 
alternative hypotheses be passed over in silence. In the Second place, 
it may now be added, after the principles and procedure have been eluci¬ 
dated (which at this stage is the primary object of this text) xhat if 
good results do not follow from a first hypothesis, the only thing the 
cryptanalyst can do is to reject that hypothesis and formulate a second 
hypothesis In actual practice Ipe may have to reject a second, third, 
fourth, •••nth hypothesis. In the end he may strike the right one—or he 
may not. There is no assurance of success in the matter In the third 
place, one of the objects of this text is to show how certain crypto¬ 
systems, if employed for military purposes, can readily be broken down 
Assuming that some type of digraphic system is in use, and that daily 
changes in key words are made, it is possible that the traffic of the 
first day might give considerable difficulty in solution if the specific 
type of digraphic system were not known to the cryptanalyst But by the 
time two or three days' traffic had accumulated it would be easy to solve, 
because probably by that time the cryptanalytic personnel would have suc¬ 
cessfully analyzed the cryptosystem and thus learned what type of matrix 
or table the enemy is using 


70 Analysis of two-square matrix systems —a Cryptosystems in¬ 
volving either vertical two-square or horizontal two-square matrices may 
he Identified as such and solved by capitalizing on the cryptographic pe¬ 
culiarities and idlosyncracies of these systems It will be noted that, 
considering the mechanics of the cryptosystems, in vertical two-square 
matrices employing the normal enciphering conventions,^5 exactly 20$ of 
the 625 "possible" plaintext digraphs will be "transparent" (i.e , seif- 
enciphered) in cipher text, in horizontal two-square systems, exactly 2C$ 
of the 625 digraphs will be characterized by an "inverse transparency" 


25 -- 

That is, for vertical two-square systems, digraphs are self-enciphered 

1 2 

if ©„ and 6_ fall in tne same column in the matrix, and, for horizontal 
^ 1 2 

two-square systems, if ©;: and © are in the same row, the ciphertext dl- 

xr Jr 

graphs are the reversed plaintext digraphs 
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(i.e., enciphered by the same digraphs reversed). Therefore, if an 
examination of a cryptogram or a set of cryptograms discloses a goodly 
portion of what appear to be direct transparencies (cipher digraphs which 
could well be plaintext digraphs),'it may then be assumed that a vertical 
two-square matrix has been need for the encryption. On the other hand, 
if a large number of cipher digraphs could be "good" plaintext digraphs 
if the positions of the letfcers were reversed , then it may be assumed that 
the cryptosystem involved a horizontal two-square matrix Sometimes 
skeletons of words or even of whole phrases are self-evident in such 
cipher text, thus affording an easy entering wedge into the cryptosystem 

b An example will best serve to illustrate the techniques of 
identification and subsequent solution of a two-square matrix cipher. 

The following naval message Is to be studied 
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V Y 0 I G RSLXX 

Preliminary steps in analysis are made according to the procedures already 
described in this text, and the hypothesis of monographic, uniliteral en¬ 
cipherment (with either standard or mixed cipher alphabets) has been re¬ 
jected Multiliteral substitution, or digraphic substitution, comes next 


Although 625 "possible" plaintext digraphs are involved, the identity 
of digraphs actually used in plain text limit this figure considerably 
Furthermore, the frequencies of the plaintext digraphs actually used come 
into consideration, in conjunction with the location of the letters of 
these digraphs in any particular two-square matrix Thus, from the crypt¬ 
analyst's standpoint, there are "excellent" two-square matrices giving a 
high self-encipherment rate for high frequency plaintext digraphs, and 
there are "poor" two-square matrices which have a potentially high self- 
encipherment rate only foi those low frequency plaintext digraphs which 
may not occur at all in a given cryptogram 
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into consideration The cipher text is written m digraphs, as follows 


5 10 15 



Noting the 8-letter repetition 90 letters apart, the 6-letter repetition 
16 letters apart, and the 4-letter repetition at an interval of 220 let¬ 
ters, and that those repetitions begin on odd letters and end on even 
letters, credence is given to the grouping of the cipher text into pairs 
of letters. A digraphic distribution is then made, illustrated in Fig- 

63 
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Figure 63 


c The 152, is most satisfactory when compared with ^ (107) 

and ( 23 ) Since the cryptogram has all the earmarks of a digraphic 

cipher, and no manifestations are found to support the hypothesis of a 
multiliteial system, the next problem is the specific determination of the 
particulai kind of digraphir system involved It may be noted tnat there 
are quite a few digraphs in the cipher text which resemo3e good plaintext 
digiaphs, proportionally more so than, for instance, m the cryptogram in 
subpar 69 c, the cryptologic finger points to the possibility of a two- 
square system However, since the words "good digraphs" are serrantical3y 
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elusive , lei us attempt to determine statistically wnethci or not a tvo- 
eqUare System might be involved and, if a two-sqUafe , whether it is more 
probably a Vertical or a hori^ohtaj tvo-squar§ 

d First« fof the pUi*po§§ 8f determining whether (t direct trans- 
paraaaies' 1 or "inverse tran§paP§h2ids 1 ' predominate in this cryptogram.) the 
digraphs df the distribution in Fig 63 will be set down in tabular form, 
With an indicat Jon of their frequehcjf ih the cryptogram, and with data 
relative to the probability of these “digraphs as plaintext digraphs and 
a§ plaintext digraphs blien feVersddT In the table on page 194 , col (l) 
is a listing of the ciphertext digraphsj 651 ( 2 ) is the frequency of the 

ciphertext digraph as it occiif §"in the cryptogram, cdl ( 3 ) is the 3 oga~ 
rithirt df the^theoretical plaihtext frequency of the particular digraph 
(from Sable If, Appendix 2), Col (4) represents the products of the 
entries in cols ( 2 ) and ( 3 ); col» (5) is the logarithm of "the theoretical 
plaihtext frequency of the reversed digraph (ffoBi fable 15, Appendix 2), 
afld col. (6) represents the products df the Shtries in cols. (2) and (*>) • 
From this, the Siild of the valueS in col. (4), 5S*3^> is taken to be the 
,( direct tranSp&.rehSV'' Value, §M the shift 5f the Values ih 551» (6), 63 .02, 
i§ taken to be the "lnVbrSe traflspdfdfidy” value. fhiis, since this par¬ 
ticular cryptogram has'In 11 inverse transparency*' Value which is higher 


0*7 ™* ■" 4 j- 

(The test to he described id the following subparagraphs is based on 
ah evaluation 8f those instances therein the Observed frequency of any 
particular Ciphertext digraph approximates the frequency with wkicn the 
particular digraph, or its reVSFBal, wobld be expeSthd to dedur Df con¬ 
sidered as a plaintext digraph Any §Uch corfSlatidh Which occurs m a 
fouf-s~qliare or Playfair Cipher, or in a 6fyptdgfain produced by a large 
randomized digraphic table, is purely accidental becaUsd it is not a 
rishlt of the mechanics 6f the system HoveVer. in two-sqUare ciyptografnb 
sUch correlation is caused by the fllchaftiSS 6f the system in the encipher¬ 
ment of 20$ of the possible plaintext digraphs, aiid these causal instances 
of correlation 6ccur in addltioh to Any accidental instances dhich may 
aris§ in the encipherment of the remaining 80$ Thus, if a digraphie 
cipher Exhibits mdrdly the random feXpdStation df corrdlatioii both when the 
particular ciphertext digiaphs are cdhsidtered as they afe' and when their 
reVeFSals are' considered, the cryptogram may be assbilled to involve a 
system 6ther than two-sqUafe If a digidphic cipher exhibits more than 
the random expectation of correlation, hither When the parVicblar digraphs 
§±e considered direct or wheri coHsldferdd reVefedd, it Way be assWdd to 
iriV&iVe tw8-sqUahe SHciphermdnt, and the pahticblar cSSsideration—that of 
the digraphs direct of that of the digraphs reversed—vhnch gives flse xo 
the greater degree of coFFeiatioH indicates whether the cryptogram involves 
a Vertical two-sejiiare 6r & horizBhtal two-sqddre, respectively 
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(1) 

(2) 

(3) 

(4) 

(6) 

(6) 

(1) 

(2) 

(3) 

(4) 

(6) 

(3) 

tt) 


(8) 

(4) 

(5) 
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1 
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*78 

0,78 

HL 

2 
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0,26 

,13 

0,28 

OS 

3 

.61 

1,83 

.62 
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AN 

1 

.89 

0,89 

.72 

0.72 

HS 

1 

,38 

0,38 

.72 

0.78 

PO 

1 

.64 

0.64 

,72 

,0,72 

AP 

1 

,68 

0.58 

,61 

0.61 

IE 

1 

.69 

0,60 

.73 

0,73 

FT 

1 

.51 

0,61 

.26 

0.26 

AQ 

1 

.00 

0.00 

.00 

0.05 

XH 

1 

,00 
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.77 
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QW 

1 

.00 
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.00 
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BB 

2 

.00 
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.00 
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.48 

1.44 

.46 

1,35 

OX 
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.00 
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,00 
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BP 

1 

.00 

0.00 

.00 

0,00 

XR 

2 

.73 

1.46 

.75 

1,50 

RA 

4 

#80 

3.20 

,82 

3.28 

BQ 

1 

1 

0.00 

.00 

0.00 

IS 

1 

.78 

0.78 

.77 

0,77 

KB 

1 

,26 

0,26 

.26 

0,26 

CE 

3 

<6 

2,28 

,76 

2.28 

KP 

1 

.00 

0.00 

.00 

0,00 

K0 

2 

,63 

1.06 

.38 

0,70 

CR 

2 

,38. 

0,76 

.63 

1.06 

KR 

1 

.00 

0.00 

,13 

0.13 

RG 

1 

.48 

0.48 

.42 

0.42 

CW 

1 

.13 

0,13 

.00 

0.00 

KS 

2 

.13 

0,26 

.13 

0,20 

Kl 

1 

.76 

0.75 

,73 

0.73 

DA 

2 

.76 

1 52 

,73 

1.46 

LC 

4 

.33 

1.32 

.42 

1,68 

RK 

. 1 

.13 

0.13 

.op 

0.00 

DD 

1 

.51 

0.61 

.61 

0.51 

LN 

2 

.13 

0.26 

.42 

0.84 

SE 

1 

,84 

0.84 

.80 

0.86 

DE 

1 

.77 

0.77 

.88 

0.88 

10 

1 

.59 

0.50 

.'67 

0.67 

SI 

1 

,77 

0.77 

.78 

0.78 

DI 

4 

.73 

2 92 

45 

1.80 

LP 

1 

.33 

0.33 

.59 

0.59 

SL 

1 

.26 

0,25 

,45 

0.45 

DL 

1 

CO 

CO 

• 

0.33 

.53 

0.63 

nr 

1 

.51 

0,51 

.42 

0.42 

SN 

2 

.38 

0,76 

.71 

1.42 

DT 

1 

.62 

0.62 

.45 

0.45 

MA 

1 

.78 

0.78 

.61 

0.61 

SS 

1 

.67 

0,67 

,67 

0,67 

EE 

1 

.81 

0.81 

,81 

0.81 

MD 

1 

.13 

0.13 

,42 

0,42 

TA 

3 

.74 

2.22 

.83 

2,49 

El 

1 

.73 

0.73 

.59 

0.59 

MO 

2 

55 

1.10 

.72 

1.44 

TI 

1 

.82 

0.82 

73 

0 IS 

EK 

1 

.00 

0,00 

,45 

0.45 

NN 

4 

.51 

2.04 

.61 

2.04 

TQ 

1 

.13 

0.13 

.00 

0.00 

EN 

1 

.99 

0.99 

.87 

0.87 

NO 

7 

.66 

4.62 

.92 

5,74 

UB 

1 

.33 

0# 33 

.25 

0,26 

EQ 

1 

.58 

0,58 

.00 

0.00 

HX 

1 

.00 

0.00 

.13 

0,13 

UC 

1 

,33 

Q, 33 

OO 

CO 

• 

0 38 

ER 

1 

.94 

0.94 

.96 

0.96 

NZ. 

1 

.00 

0.00 

.00 

0,00 

UO 

2 

.13 

0,26 

.79 

I.fa8 

FI 

1 

.80 

0.8Q 

.55 

0.65 

OC 

1 

.51 

0,51 

.80 

0.80 

VY 

1 

.00 

0,00 

.00 

0.00 

GB 

1 

.00 

0,00 

.00 

0,00 

OE 

1 

33 

0.33 

.58 

0.58 

XX 

1 

.00 

0,00 

.00 

0.00 

GL 

2 

.25 

0,50 

.13 

0.26 

OG 

1 

.25 

0.25 

.45 

0,45 

YF 

1 

.56 

0,56 

.13 

0.13 

GP 

1 

.25 

0.25 

.00 

0.00 

01 

3 

,42 

1.26 

.80 

2.40 


125 


58,34 
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GR 

3 

.42 

1.26 

.48 

1.44 

OL 

1 

67 

0 67 

.59 

0.59 








(1) Identity of cipher digraph appearing in the cryptogram. 

(2) Frequency of the particular digraph as it occurs in the cryptogram 

(3) Logarithm of theoretical plaintext frequency of the particular digraph (from Table 15, Appendix 2), 

(4) Product of entries in columns (2) and (3). 

(5) Logarithm of theoretical plaintext frequency of the digraph^ reversal (from Table 15, Appendix 2). 

(6) Product of entries in columns (2) and (5). 
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than the "direct transparency" value, it may be assumed^ to involve a 
horizontal two-Square—if, indeed, two-square encipherment has been em¬ 
ployed It is now for us to establish whether or not this latter is the 
case, and this will be done by determining whether or not the foregoing 
observed value, 63 02, is representative of the degree of tiansparency 
which may be expected in a horizontal two 1 -square cipher, (if the "direct 
transparency" value had been the higher of the two, then it would have 
been more probable that a vertical two-square were involved, and it would 
be necessary to determine whether or not this observed value was repre¬ 
sentative of the degree of transparency expected in a vertical two-square 
cipher) 

e The observed "inverse transparency" value (selected m this case 
because at is the higher observed value) will be compared with the value 
expected from a horizontal two-square cryptogram of the same size, and if 
this observed value is a3 great as or greater than the transparency value 
expected for horizontal two-squares, the cryptogram may be considered to 
be a horizontal two-square cipher, if the observed value is lower than 
the expected two-square value, decision will have to be suspended The 
transparency value expected in a horizontal two-square cipher containing 
N digraphs is computed by multiplying N by . 3388 ? which in this case 


““28- 

Actually, if the two-square hypothesis is made, the difference be¬ 
tween the horizontal two-square value and the vertical two-square value 
will indicate the degree of probability of the higher score over the 
lower In this case, the difference of 4 68 (= 63 02 - 58 34), which 
represents a difference of log scores , is equivalent to an overwhelming 

ratio of 100 billion to 1 (i.e , 224 ^ to l) in favor of the hypothesis 
of a horizontal two-square The foregoing computation involves an aspect 
of mathematics which will be given detailed treatment in Military Crypt¬ 
analysis, Part III 

29 _ 

For the benefit of the student with a background in statistics, it 

is pointed out that by abiding by the stipulation "as great or greater", 
some cryptograms which actually are the result of two-square encipherment 
may be rejected by this stipulation, but it will insure that only a rela¬ 
tively few non-two-square cryptograms will be accepted A better approach 
of a statistical nature would involve, first, computing the expected 
value for non-two-squares as "well as that for two-squares Then, any 
observed value falling below the expected two-square value could he 
expressed in terms of the number of standard deviations (i e , the 
sigmage) from this expected two-square value and, from the expected non - 
two-square value. Finally, the particular expected value which would be 
considered as significant would be the one from which the observed value 
differed by the smaller number of standard deviations The concept of 
standard deviation will be treated in Military Cryptanalysis, Part III. 
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yields 42 35 (= 3388 x 125).3° The observed value for the cryptogram, 

63 02, is much higher than the expected value, 42 35* Thus, it has been 
proven statistically that the cryptogram at hand Involves two-square 
encipherment, particularly, horizontal two-square encipherment 

f Having now proved that the cryptogram at hand is a horizontal 
two-square cipher, the next step is to assume some plain text in the 
message, guided by probable inverse transparencies ( inverse because the 
system has been identified as a horizontal two-square) in the cipher text. 
Referring to the work sheet in Fig 62, the repeated sequence at B9 and 
E9 is assumed to represent the plain text TA SK F0 RC (E-), on the basis 
of KS c =SKp, and CR c =RCp The plaintext-ciphertext values are now 


30 

In the case of vertical two-squares, N would be multiplied by the 
constant 3610 The mathematical considerations underlying this test and 
their -proofs (involving Bayes' theorem and Bayes* factors) are beyond the 
scope this text, however, for the benefit of the mathematician, the 
derivation of the foregoing constants 3 s explained below, along with the 
derivation of the constant used for computing the expected transparency 
value for non -two-squares In the formulas, below, 

£ 

s the summation over all digraphs AA-ZZ 

*AB - frequency of a given digraph AB as found m Table 6A, 
Appendix 2 

^AB “ logarithm (to the base 224) of the frequency of a given 
digraph AB as found in Table 15, Appendix 2 

For vertical two-squares, 


k : 2. oc 
AB* AB 


[.8o(.ooi5) -fr 36 io 


For horizontal two-squareB, 


k = 2 <« 

AB BA 


f 80( 0015) + ■ £? L 3388 
L 5000 J 


For non -two 
k 


square digraphic 

= - 2737 

676 


sys terns, 
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31 ~~~ 

recorded in a skeleton reconstructibn diagram as illustrated in Fig. 
64a. At A3, the assumption of (-R) EC ON NA IS SA NC (E-) is tossed off 
without much ado, since four of the six digraphs concerned are trans¬ 
parent. The plain-cipher relationships from this assumption are added 
to the reconstruction diagram, as shown in Fig. 64b. Continuing in this 
vein, the plain text (-A) IR CR AF (T-) is inserted at A10, and the plain 
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Figure 64b. 


text (-B) AT TL ES HI (P-) is inserted at J3, the successive cumulative 
reconstruction diagrams for these two assumptions are shown in Figs. 64c_ 
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Figure 64c. 


During the reconstruction of the squares of the matrix, the student 
should keep clear in his skeleton diagram which letters are in the same 
row, and which are in the same column. It will he found expeditious to 
draw a dividing line (either horizontal or vertical, depending on the 
type of two-square matrix involved) on the page to keep the elements of 
the two squares independent, recording the values which are in the same 
row or column and writing down the letters as they are assumed. In the 
early stages of this process the student must exercise care in recording 
the letters so that no false relationships are formed, in other words, 
the values should he written down so that they are not in the same row or 
column with any letters other than those with which they are known to he 
related. This will entail spreading the work rather widely over the page 
initially, then gradually telescoping and reducing the size of the re¬ 
construction diagram as the work progresses, until in the end it will he 
reduced to a concise matrix of two 5x5 squares. 
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and 64d below. It is to be noted that at J7, 0C r> =P8p; but since in 

Fig. 64d it has already been determined that 0C c =9Sp, then 0C c must equal 

PS aking the word BATTLESHIPS rather than in the singular. 

* 

g. At this point the partially filled-in work sheet will look as 
follows: , i, " > * 
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Skeletons of additional plain text, such as the word OUR at Al, PRESENCE 
OF ENEMY at Bl, PROBABLE at Dl, ATTACK ON OUR INSTALLATIONS at DIO, 
CARRIER at Fl4, and VESSELS at Jl, may now clearly be seen The complete 
recovery of the plain te-ct follows, and the reconstruction diagram is 
completed and telescoped into the form shown In Fig 64e Since phe¬ 
nomena of keyword-mixed sequences are observed, the rows' and columns of 
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Figure 64c " " ‘ Figure 64f 

Fig 64e are permuted to yield the original two-square matrix as shown 
in Fig 64f - - 

i > * 

h The solution of vertical two-square systems follows analogous 
lmes7 with the necessary modifications of the reconstruction diagram in 
consonance with the difference in mechanics between horizontal and verti¬ 
cal two-square systems 

I A few additional remarks concerning the test applled m subpars 
d end e, above, eire in order First, the exceptionally high transparency 
value observed in this cryptogram is a direct result of the very favor¬ 
able manner in which the keyword-mixed sequences m the two squaiOs inter¬ 
act, m the foregoing cryptogram, 47 of the 325 digraphs present (approx 
38 io) were inverse transparencies It is also pointed out that, although 
some actual two-square cryptograms may be rejected by that portion of the 
test which was described in subpar e, the other phase of the test (de¬ 
scribed in subpar d)—by which one may determine whether a cryptogram is 
more probably a vert j cal two-square encipherment or more probably a hori¬ 
zontal two-square encipherment--is extremely sensitive and highly accu¬ 
rate The foregoing statistical method is not merely valuable per se as 
an application of cryptomathematics in the analysis of two-square matrix 
systems, but is included as being illustiative of the general principles 
of special, techniques that may be developed in the attack on any particu¬ 
lar cryptosystem, the mechanics of which are known to the cryptanalyst 
The field of actual operational cryptanalysis is replete with special 
methods of attack of this nature ~ 

71 Analysis of Playfair cipher systems --a Of all digraphic 
cryptosystems employing small matrices, the one which has been most 
frequently encountered is the Playfair cipher Certain variations of 
this cipher have been incorporated m several complex manual ciphers 
used in actual operational practice, because of this it is important that 
the student gain familiarity with the methods of solution of the classic 
Playfair system 
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b The first published solutions^ for this cipher are quite 
similar basically and vary only in minor details The earliest, that by 
Lieut. Mauborgne (later to become Chief Signal Officer of the U.S. Army), 
used straightforward principles of frequency bo establish the values of 
three or four of the most frequent digraphs Then, on the assumption 
that in most cases in which a keywoid appears on the first and second rows 
the last five letters of the normal alphabet, VWXYZ, will rarely be dis¬ 
turbed in sequence and will occupy the last row of the square, he "jug¬ 
gles” the letters given by the values tentatively established from 
frequency considerations, placing them in various positions in the square, 
together with VWXYZ, to correspond to the plaintext-ciphertext relation¬ 
ships tentatively established A later solution by Lieut Frank Moorman, 
as described in Hitt’s manual, assumes that in a Playfair cipher prepared 
by means of a square in which the key word occupies the first and second 
rows, if a digraphic frequency distribution is made, it will be found 
that the letters having the greatest combining power are very probably 
lette >f the key A still later solution, by Lieut Commander Smith, 
is peihaps the most lucid and systematized of the three He sets forth 
in definite language certain considerations which the other two writers 
certainly entertained but failed to indicate 

£ The following details have been summarized from Smith's so¬ 
lution 

(1) The Playfair cipher may be recognized by virtue of the fact that 
it always contains an even number of letters, and that when divided into 
groups of two letters each, no group contains a repetition of the same 
letter, as UN or EE Repetitions of digraphs, trigraphs, and polygraphs 
will be evident m fairly long messages 

(2) Using the square^ shown in Fig 65 , there arc two general cases 
to be considered, as regards the results of encipherment 
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Figure 65 


32 

Mauborgne, Lieut J. 0., U S.A An advanced problem in cryptography 
and its solution, Leavenworth, 1914 , 

Ilitt, Captain Parker, U.S.A Manual for the solution of military 
ciphers , Leavenworth, 1918 

Smith, Lieut Commander W. W., U.S.N. In Cryptography by Andre 
Langie, translated by J.C.H. Macbeth, New York, 1922 

33 

The Playfair square accompanying Smith's solution is based upon the 
key word BANKRUPTCY, "to be distributed between the first and fourth lines 
of the square" This is a simple departure from the original Playfair 
scheme in which the letters of the key word aie written from left to right 
and in consecutive lines from the top downward 
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Case 1 Letters at opposite comers of a rectangle T?ho following 
illustrative relationships are found 

TfUIic 


tp-EXe 
2M3e 


Reciprocity and reversibility 


34 


Case 2 Two letters in the same row or column. The following il¬ 
lustrative relationships are found 

Wv-Mc 

But RKp does not = AK C , nor does KNp=M c . 

Reversibility only 

v * 

(3) The foregoing gives rise to the foil)owing 

Rule I (a) Regardless of the position of the letters in the 
square, if 


1 2l3 4, then 

2 i:u 3 

> * 

This rule is of particular aid in selecting probable words in the 
solution of Playfair ciphers, as will be shown shortly 35 

(b) If 1 and 2 form opposite corners of a rectangle, the 
following equations obtain 

„ , - 1 2=3 4 

2 1 = 4 - 3 

3 4=1 2 

4 3=2 1 


oft 

J By way of explaining what is meant by reciprocity and by reversi ¬ 
bility, in the case of digraphic systems, the following examples are 
given THp=YF c and YFp^Hc constitute a reciprocal relationship, 

THp=YF c and HTpIFY c constitute a reversible relationship 
qc 

J In this connection, a list of frequently-encountered words and 
phrases which contain reversed digiaphs (so-called "ABBA patterns") has 
been compiled and is included as Section E, "Digraphic ldiomorphs. 
Playfair", in Appendix 3 
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(H) A letter considered as occupying a position in a row can be com¬ 
bined witli bub four other letters in the same row, the same letter con¬ 
sidered as occupying a position in a column can be combined with but four 
other letters in the same column. Thus, this letter can be combined with 
only 8 other letters all told, under Case 2, above But the same letter 
considered as occupying a corner of a rectangle can be combined with 16 
other letters, under Case 1, above Smith derives from these facts the 
conclusion that "it would appear that Case 1 is twice as probable as 
Case 2". He continues thus (notation my own)* 

"Now in the square, note that 

ANp sNKU IFL^, 

GNpsFKc EMpCFL c 

OICsMK„ also Mj,=FP^ 

CMpSTK/. EW1SFV„ 

XN£=^K C EF p =FO c 

"Fiom this it is seen that of the Qk equations that can be formed 
when each letter of the square is employed either as the initial or final 
letter of the group, five will indicate a repetition of a corresponding 
letter of plain text 

"Hence, Rule II After it has been deteimined, in the equation 

1 2=3 that, say, ENp=FA c , there is a probability of one in five that 

any other group beginning with F c indicates le p , and that any group 
ending in A c indicates ©Np 3° 

"After such combinations as ERp, QRp, and ENp have been assumed or 

determined, the above rule may be of use m discovering additional di¬ 
graphs and partial words” 


w 


There is an error m this reasoning 
Cose 


FB 0 =DN P 
FD =EH 
FI =DM 
FU =DT 
FS =DW 
FA =EN 


Take, foi example, the 24 equations having F as an initial letter 

Case 


rE=CD 

FL=EM 

FP=ET 

FV=EW 

FN=NY7 

™=NF 


Cose 

2 

2 

1 

2 

1 

1 


FT=NM 

FW=NT 

rK=GN 

FG=Er 

F0=GM 

FC=GT 


FX=GW 

FR=HN 

FH=EG 

FQ=HM 

FY=HT 

FZ=HW 


Here, the initial kttcr F 0 represents the following initial letters of plain-text digraphs 


DQ P , EO, N0 P> G0 P , and He p 

It is seen that F c represents D PI N D G p , H t 4 tunes each, and E pr 8 times Consequently, supposing that it has 
been determined that FA 0 = EN P , the probability that F 0 will represent E p is not 1 in 5 but 8 m 24, or 1 in 3, but 
supposing that it 1ms been determined that FW 0 =NT VI the probability that F 0 w ill represent N p is 4 in 21 or 1 in 6 
The difference In these probabilities is occasioned bj the fact that the first instance, FA^ENp corresponds to a 
Case 1 encipherment, the second instance nV q =NT p , to a Case 2 encipherment But there is no waj of knowing 
Initially, and without other data, whether one is dealing with a Case 1 or Case 2 encipherment Qulj as an 
approximation then, fore, maj one *av that the piobabiht\ of F e repre anting a given 0 P is 1 in 5 A probability 
of 1 in 5 ib of almost trivial importance in this situation "ince it icpresents -uch a long shot ' foi success Tilt 
following rule nurht be preferable If the equation 12 = 31 has been established where all the lettcib represented 
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Rule III In the equatiofi 1 2r3*4, 1 and 3 can never be identical, 
nor can 2 and 4- ever be identical Thus, ANp could not possibly be 
represented by AY e , nor could ERp be represented by KR C This rule is 
useful in elimination of certain possibilities when a specific message 
is being studied _ • 

Rule IV In the equation 1.2pX3»4 c , ^ 2 and 3 are identical, the 
letters are all m the same row or column, and in the relative order 

1- 2-4 from left to right or top to bottom, respectively In the square 

shown, ANp =M C and the absolute order is AM The relative order 1-2-4 
includes five absolute orders which are cyclic permutations of one 
another Thus* AM.., M. .A, K..AN, ..AM, and .AM 

Rule V In the equation 1 2p=3*4 c , if 1 and 4 are Identical, the 
letters are all in the same row or column, and in the relative order _____ 

2- 4-3 from left to right or top to bottom In the square shown, KKpdRKc 

and the absolute order is MR The relative order 2-4-3 includes five 
absolute orders which are cyclic permutations of one another Thus 
MR.., KR .N, R..M, ..MR, and .MR.. 

Rule VI "Analyze the message for group recurrences. Select the 
groups of greatest recurrence and assume them to be high-frequency di¬ 
graphs. 37 Substitute the assumed digraphs throughout the message, test¬ 
ing the assumptions in their relation to other groups of the cipher The 
reconstruction of the square proceeds simultaneously with the solution of 
the message and aids in hastening the translation of the cipher". 


by 1, 2, 3, and 4 are different, then there is a probability of 4/5 that a Case 1 encipherment is involved Conse¬ 
quently, if at the same time another equation, 3 6=5 2, has been established, where 2 and 3 represent the samo 
letters as in the first equation, and 5 and 6 are different letters, also different from 2 and 3 there is a probability 
of 16/25 that the equation 1 6 = 5 4 is valid or if at the same time that the equation 12 = 34 has been deteimined, 
the equation 1 6=5 4 has also been established, then there is a probability of 16/25 that the equation 3 6 = 5 2 is 

13 84 30 52 10 64 

valid (Check this by noting the following equations based upon Tig 25a CE = PG, PH = YE, CH=YG Note 
the positions occupied in Fig 25a by the letters involved) Likewise, if the equations 1 2=3 4 and 16 = 3 5 
have been simultaneously established, then there is a probability that the equation 2 5 = 4 6 is valid, or if the 
equations 12 = 34 and 2 5 = 4 6 have been simultaneously established, then there is a probability that the 

12 34 10 35 25 40 

equation 2 5 = 4 6 is valid (Check this by noting the following equations CE = PG, CA = PK, EK = GA, note the 
positions occupied in Fig 25a by the letters involved ) Howevei, it must be added that the^e probabilities are 
based upon assumptions which fail to take into account any considerations whatever as to frequency of letters 
or specificity of composition of the matrix For instance, suppose the 5 high-frequency letters E, T, R, I N all 
happen to fall in the same row oj column m the matrix, the number of Case 2 enCipheiments would be much 
greater than expectancy and the probability that the equation 12 = 34 represents a Case 1 encipherment falls 
much below 4/5 

37 

A more accurate guide to the determination of the plaintext equiva¬ 
lents of hngh-frequency cipher digraphs would involve the consideration 
of the difference m frequency of a particular digrapn and its reversal 
Thus, an example of a high-frequency 80p which is also high-frequency m 

its reversal, is REp, an example of a high-frequency 88p which is rarely 
found in its reversed form, is THp “ 
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d (l) When solutions for the Playfair cipher system were first 
developed, based upon the fact that the letters were inserted m the 
cells in keyword-mixed order, cryptographers thought it desirable to 
place stumbling blocks in the path of such solution by departing from 
strict, keyword-mixed order One of the simplest methods is illustrated 
in Fig 65 , wherein it will be noted that the last five letters of the 
keyword proper are inserted in the fourth row of the square instead of 
the second, where they would naturally fall. Another method involves 
inserting the letters within the cells from left to right and top down¬ 
ward but using a sequence that is derived from a columnar transposition 
instead of a keyword-mixed sequence Thus, using the keyword BANKRUPTCY. 

215 4 79683 10 

BANKRUPTCY 
DEFGHILMOQ 
S V W X Z 

Sequence AEVBDSCOKGXNFWPLRHZTMUIYQ 
The Playfair square is as follows 


A E V B D 
S C 0 K G 
X N F W P 
L R II Z T 
M U I Y Q 


Figure 66a 

(2) Note the following three squares 


0 K G S C 
FHPXN 
H Z T L R 
I Y Q M U 
V B D A E 


Figure 66b Figure 66c 

At first glance they all appear to be different, but closer examination 
shows them to be cyclic permutations of one another and of the square m 
Fig. 66a. They y-eld identical cryptographic equivalents in all cases 
However, if an attempt be made to reconstruct the original key word, it 
would be much easier to do so from Fig 66a than from any of the others, 
because in Fig 6ba tne original keyword-mixed sequence has not been 
disturbed as much as in Figs 66b, c, and d In woikmg with Playfair 
ciphers, the student should be On The lookout for such instances of 
cyclic permutation of the original Playfair square, for during the course 
of solution he will not know whether he is building up the original or an 
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Figure 66d 
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equivalent cyclic permutation of the original matrix, Usually only after 
he has completely reconstructed the matrix will he he able to determine 
this point 

e (l) The steps jn the solution of a typical example of this 
cipher will now he illustrated Let the message he as follows 
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C 

t 

N 

0 

T 

Y 

0 

L 

G 

X 

.X 

I 

I 

H 

G 

T 

M 

S 

M 

x_ 

C 

X 

X 

0 

T 

c 

x_ 

0 

jr 

T 

C 

Y 

A 

T 

E 

X 

H 

F_ 

A 

C 

x' 

* 

x_ 

_C_ 

P 

Z > 

H. 


_H 

Y 

C 

T 

X 

w 

L 

z 

T 

s 

G 

p 

z 

T 

V 

Y 

W 

C 

E 

T 

W 

G 

c 

C 

M 

B 

H 

M 

Q 

J 

Y 

X 

Z 

P 

W 

G 

R 

T 

I 

V 

u 

X 

p 

u 

M 

Q 

R 

K 

M 

W 

C 

X 

T 

M 

R 

S 

W 

G 

H 

B 

K 

X 

C 

JP 

X 

0 

T 

jC 

X 

0 

X 

M 

I 

p 

Y 

D 

N 

F 

G 

K 

I 

T 

c 

0 

L 

X 

u 

E 

T 

P 

X 

L 

X 

F 

S 

R 

s 

U 

z 

T 

D 

B 

H 

0 

Z 

I 

G 

X 

R 

K 

I 

X 

Z 

p 

P 

V 

z 

I 

D 

U 

H 

Q 

M 

0 

T 

K 

T 

K 

C 

c 

H 

X 

X 






















(2) Without going through the preliminary tests in detail, with 
which it will he assumed that the student is now familiar,^ the con¬ 
clusion is reached that the cryptogram is digraphic in nature, and a 
digraphic frequency distribution is made (Fig 67) 


3S 


See par 


69c 
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i 

> 

s 

4 

s 

« 

r 

« 

l 

10 

u 

12 

13 

14 

16 

A 

VT 

QE 

UH 

10 

FT 

CH 

XS 

CA 

KT 

VT 

RA 

ZE 

VT 

AG 

AE 

B 

OX 

TY 

MH 

CR 

LZ 

ZT 

QT 

DU 

MC 

YC 

XC 

TG 

MT 

YC 

ZU 

C 

SN 

OP 

DG 

XV 

XS 

CA 

KT 

VT 

PK 

PU 

TZ 

PT 

WZ 

FN 

BG 

D 

PT 

RK 

XI 

XB 

PR 

ZO 

EP 

UT 

OL 

ZE 

KT 

TC 

SN 

HC 

QM 

E 

VT 

RK 

MW 

CF 

ZU 

BH 

TV 

YA 

BG 

IP 

RZ 

KP 

CQ 

FN 

LV 

F 

OX 

0T_ 

UZ 

FA 

CX 

XC 

PZ 

XH 

_CY 

NO 

TY 

OL 

GX 

XI 

IH 

G 

TM 

SM 

XC 

PT 

OT 

CX 

OT 

TC 

YA 

TE 

XH 

FA 

CX 

XC 

PZ 

II 

( XH 

YC 

TX 

WL 

ZT 

SG 

PZ 

TV 

YW 

CE 

TW 

GC 

CM 

BH 

MQ 

J 

YX 

ZP 

WG 

RT 

IV 

UX 

PU 

MQ 

RK 

Mil? 

CX 

TM 

RS 

WG 

HB 

K 

XC 

PT 

PT 

CX 

OT 

MI 

PY 

DN 

FG 

KI 

TC 

OL 

XU 

ET 

PX 

L 

XF 

SR 

SU' 

ZT 

DB_ 

HO 

ZI 

GX 

RK 

IX 

ZP 

PV 

ZI 

DU 

HQ 

M 

OT 

KT 

KC 

CH 

XX 


- 

- 






- 


(3) 

The following three fairly 

lengthy repetitions 

4 - 

w 

are 

noted: 




Lines 

F 

OT 

UZ 

FA 

CX 

XC 

PZ 

XH 

CY 

NO 







G 

TE 

XH 

FA 

CX 

( — 
XC 

PZ 

XH 

YC 

TX 






r 

A 

FT CH " 

XS 

CA 

KT 

VT 

RA 

ZE 








V 

DG 

XV 

XS 

CA 

KT 

VT 

PK 

PU 








G 

TM 

SM 

X<? 

PT 

OT 

CX 

OT 

>> 

TC 








K 

WG 

HB 

XC 

PT 

OT 

CX 

OT 

MI_ 















j} 











* 





— 


» 




The first long repetition, with the sequent reversed digraphs CX and £C 
immediately suggests the word BATTALION X see Section E, Appendix 3)j> 
split up into -B AT TA LI ON and the sequence containing this repetition 
in lines F and G becomes as follc?v[s , , r # 

Line F OX OT UZ FA CX XC PZ XH CY NO TY 

- B AT TA LI ON 

Line G YA TE XH FA CX XC PZ XH YC TX NL 

B AT TA LI ON 

(4) Because of the frequent use of numerals before the word BAT¬ 
TALION (as mentioned in Section B of Appendix 4) and because of the ap¬ 
pearance of ONp before this word an line G, the possibility suggests 
itself that the word before BATTALION in line G is either ONE or SECOND. 
The identical cipher digraph FA in both cases gives a hint that the word 
BATTALION in line F may also be preceded by a numeral, if ONE is correct 
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in line G, then THREE is possible in 3 ine F On the other hand, if 
SECOND is correct ixTline G, then THIRD is possible in line F Thus 


Line F 

. OX 

OT 

uz 

FA 

CX 

XC 

PZ 

XH 

CY 

NO 

TY 

1st hypothesis 

— 

TH 

RE 

EB 

AT 

ta' 

LI 

ON 




2nd hypothesis 


TH 

IR 

IB 

AT 

TA 

LI 

ON 




Line G 

YA 

TE 

XH 

FA 

CX 

XC 

FZ 

XH 

YC 

TX 

ML 

1st hypothesis 


•■■■ 

ON 

EB 

AT 

TA 

LI 

ON 




2nd hypothesis 

• -S 

EC 

ON 

IB 

AT 

TA 

LI 

ON 





First, t e that if either hypothesis is true, then OT c rTBp The 
frequency distribution shows that OT occurs 6 tunes and is in fact the 
most frequent digraph in the message Moreover, by Rule I of subpara¬ 
graph b, if OT c =TBp then TO c =HTp Since HTp is a very rare digraph in 

normal plain text, T0 C should either not occur at all in so short a 
message or else it should be very Infrequent The frequency distribution 
shows that it does not occur Hence, there is nothing inconsistent with 
the supposition that the word in front of BATTALION in line F is THREE or 
THIRD, and there is some evidence that it is actually one or the other 

'“I v " ~ ~ 

(5) But can evidence he found for the support of one hypothesis 
against the other? Let the frequency distrloutIon he examined with a 
view to throwing light upon this point If the first hypothesis is true, 
then UZ c 3REp, and, by Rule I, ZU c =ERp The frequency distribution shotrs 

but one occurrence of UZ C and but two occurrences of ZU C These do not 

look very good for RE and ER On the other r bond, if the second hypothesis 
is true, then UZ c =IRp and by Rule I, ZU c =RIp The frequencies are much 

moie favorable in this case Is there'anything 3neonslatent with the 
assumption, on the basis of the second hypothesis, that TE c =ECp? The 
frequency distribution shows no inconsistency, for TE „ oc curs onceand 
ET c (rCEp, by Rule I) occurs once As regards whether FA c =EBp or rIBp, " 

both hypotheses are tenable, possibly the second hypothesis is ** ^nade_ 

better than tlie first, on the following reasoning By Rule 1^ if FA c =EBp 
then AF c =BEp, or if FA c sDBp then AF c =BDp The fact that no AF C occurs, 

whereas at least one BEp may be expected m this message, inclines one to 
the second hypothesis, since BDp is very rare 

(6) Let the 2nd hypothesis be assumed to be correct The additional 

values arc tentatively inserted in tlie text, and in lines G and K t”o 
inteiesting repetitions axe noted 1 

Line G Hi SM XC PT OT CX OT TC YA TD XH FA C-C YC PZ XII 

TA 1 i J T TH -S EC OH DB AT TA LI OH 

Lnne K MO IB XC PT OT C' GT 1H FY D T FG XI TC OL XU LT 

TA 'id / T Til 

RESTRICTED 

f'Ou 



REF ID:A 56892 





REOTrUCTED 


This cej to inly looks like STATE TIIAT THE . , , which -would make TEL-PT C 

Furthermore, in lane G the sequence STATETHATTI1E ,SICQIftlBATiALION c*n 
hardly be anyt hing else thap. STATS THAT TI^IR SECOND BATTALION, which 
would moke TC c =EIp and YA c =RSp Also &M C =-Sp 

(7) It is perhaps high time that the whole list of tentative equiva¬ 
lent values be studied in relation to their consistency witn the positions 
of letters in the Playfair square, moreover, by so doing, additional 
values may be obtained in the process The complete list of values is 
as follows 


Assumed values 

Derived by Rule I 

Wp-Mp 

LIp“PZ c 

gp=Mc 

IIp=ZP c 

ONplXHc 

NOprIKc 

ra p :0T c 

HTp=TO c 

IRplTJZft 

RIpzZUc 


BDp=AF c 

FC p =IE c 

CEp^ETc 

TEpiPTc 

m-Tic 

EIp=TCc 

XEp=CT c 

£§P=Bc 


-S p =SM c 

S-p=MS c 


(8) By Rule V, the equation THp=OT c means that H, 0, and T are all 
in the same row or column and in the absolute order HTO similarly, C, E, 
and T are in the same row or column and in the absolute order CET 
Further, E, P, and T are in the same row and column, and their absolute 
order is ETP That is, these sequences must occur someplace in the square, 
in either rows or columns, taking into consideration of course the proba¬ 
bility of cyclic displacements of these sequences within the square 

* _f - ^ * 4 m * 

l JX 

(a) H T 0 ^ -JJ») CET w ‘ (c)STP 

c 

(9) Noting the common letters E and T in the second and third se¬ 
quences, these two sequences may he combined into orie sequence of four 
letters, viz , C E T P Since only one position remains to be filled in 
this row (or column) of the square, and noting m the list of equivalents 
that EIp=TC c , it is obvious that the letter I belongs to the C E T P 
sequence, the complete sequence is therefore C E T P I 

(10) Since the sequence HTO has a common letter (T) with the sequence 
CETPI, it follows that if the HTO sequence occupies a ro j, then the 
CETPI sequence must occupy a column, or, if the HTO sequence occupies a 
column, then the CETPI sequence must occupy a row, and they may he com¬ 
bined by means of their common letter, T, viz 



or 


« C 


E 


I 
I 
I 

H T 0; 
P * 
I j 
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The pioof of whether the CETPI sequence, foi example, properly belongs as 
a row or a column of the Playfair square lies m the establishment of a 
rectangular relationship, instead of the linear relationships constructed 
thus far. 


_ ( ll ) Me note that, from the assumptions in subpar d(6), Afp=CX c and 

ONpCXH,, The relationship ONp=XH c might be either a rectangular one, such 
as 0 .X, or it might be linear, viz , HTOXN or H Since however 

T 

H N 0 

X 

N 


ATpZCX c must be a rectangular relationship, then only the configuration 


• c a ; 

- 

1 

Ic E 

H 

•» ■» 

| E ‘ 

T P 

l| 

|H T 0 x N| will be valid. 

since the alternative form 1 

0 


! P • 

iA 

X 


L I_J 

1 

1 

N 



will not 


satisfy the equation AT p sCX c 

(12) The fragmentary Playfair square^ has been established, in one 
of its 25 possible cyclic permutations, as 


C A 
E 

HTOXN 

P 

1 


Scanning the list of plain-cipher equivalents given in subpsr d(7) in 
order to insert possible additional letters, none is iound But seeing 
that several high-frequency letters have already been inserted in the 
matrix, perhaps reference to the ciyptogram itself m connection with 
values derived from these inserted letters may yield further clues For 
example, the vowels A, E, I, and 0 are all imposition, as are the very 
frequent consonants N and T The following combinations may be studied" 


ANpieXc 

Hp=eT c 

INp=6T c 

OIIpzXHc 


'ATprCXc 

I£p=Cjfc 
*OT p =XO c 

Jhf- 


Ui tp h XB q 

T'Jp3fr c 

;OprHX c 


TAp-^Cc 

TF-grPTo 

TIo-PCc 

TOprCXc 


In actual practice, it 3* more usual to start witr a much laigei 
diagram than a simple 5x5 square, as relationships develop, tne diagram 
is giadually condensed, until finally a 5^5 square ec-rge-s This pio- 
ccdure is quite similar to tnat employed in the reconstinotion diagrams 
for tv.o-square matrices 
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AT p (=CX c ), TAp(=XC c ), ON p (=XH c ), TE p (=PT c ) and p^T^) have already 

been inserted in the text Of the others, only 0X G (“TOp) occurs two 
times, and this value can be at once inserted in the text But can the 

equivalents of AN, EN, or IN be found from frequency considerations? _ 

Take EN p , for example, it is represented by ©T c What combination of 6T 
is most likely to represent EN p among the following cahdidates 

KT C (4 times), by Rule I, NE p would r TKc (no occurrences) 

VT C (5 times), by Rule I, NEp would = TV C (2 times) 

ZT C (3 times), by Rule I, NE p would = TZ C (l time) 

VT C certainly looks good it begins the message, suggesting the word 
ENEMY, and the sequence PZTV C , in line H, would become the plaintext 
sequence LINE. Let this be assumed to be correct, and let the word 
ENEMY also be assumed to be correct Then EMpZQE c and the partial squaie 
then becomes as shown herewith' 


P 

I 

C A 
V M E Q i 
N H T 0 Xl 


Figure 68a 

(13) In line E is seen the following sequence 

Line E . VT RK MW CF ZU BH TV YA BG IP RZ KP CQ FE LV 

EN RI NE RS PT -E 

The plaintext sequence RI..NERS..PT... suggests PRISONERS CAPTURED, 
as follows 

MW CF ZU BH TV YA BG IP RZ KP 
P RI SO NE RS CA PT UR ED 

This gives the following new values' ©P p =CF c , SO p =BH c , CAp=BG c , URp=RZ c , 
and EDp=I<P c The letters B and G can he placed in position in the 

partial square at once, since the positions of C and A are already known 
The insertion of the letter B immediately permits the placement of the 
letter S, from the equation SO p =BH c Of the remaining equations only 
ED p =KP c can be used Since E and P are fixed and are in the same column, 
D and K must be in the same column, and moreover the K must be in the 


RESTRICTED 


211 





REF ID:A 56892 


RESTRICTED 

same row as E There is only one possible position for K, viz , immedi¬ 
ately after Q This Automatically fixes the position of D The square 
is now as shown herewith. 


P D 
I 

G S C B A 
V M E Q K 
N H T 0 X 


Figure 68b 

(14) A review of all equations, including the very first ones es- 
tablished, gives the following which may now be used DBp3i’A c , RSp:YA c 
The first permits the immediate placement of F, the second, by elimi¬ 
nation of possible positions, permits the placement of both R and Y 
The sq j is now as shown herewith 


P F D 
Y I R 
G S C B A 
V M E Q K 
N H T 0 X 


Figure 68c 

Once more a review is made of all remainin g m used equations LXp=PZ c 
now permits the placement of L and Z IRpsUZ c now permits the placement 

of U, which is confirmed by the equation URp=RZ c from the word CAPTURED. 
There is then only one cell vacant, and it must be occupied by the only 
letter left unplaced, viz , W Thus the whole square has been recon¬ 
structed, and the message can now be deciphered. 


L(w)p 

F 

D 

z 

Y I 

U 

R 

G 

S C 

B 

A 

V 

M E 

Q 

K 

N 

H T 

0 

X 


Figure 68d 

f Reconstruction of the square on Playfair ciphers is normally 
carried on concurrently with the synthesis of the plain text, once a few 
correct assumptions have been made. Now, having just ieconstructed the 
equate as shown in Fig 68d, the question to be answered is whether this 
square is identical with the original enciphering rnatiix or whether it 
is a cyclic permutation of the original squaie (which nay have contained, 
say, a tianspos it ion-mixed sequence) Even though the cryptogram m 
subpar 7l£ has been solved, this point is still of interest 
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(1) Tho square that is derived may not necessarily be the original 
enciphering square, more than likely it wi31 be one of the 2h possible 


cyclic permutations of the original square If the Playfair square con- 
s1sted of a key*ford-mixed sequence, a permutation of the square will 
cause no difficulty in recovering the original matrix and hence the key 
word For example, if the square derived m some other instance is 
Q T L H 0 then the square P Y R A M is easily recovered because of the 


x z u v w 

A M P Y R 
B C I D S 
II K E F G 


I D S B C 
E F G H K 
L N 0 Q T 
U V W X Z 


tell-tale letters UVUXZ occurring in a row of the derivative square But 
when the Playfan square consists of a transposition-mixed sequence, then 
a different procedure must be adopted 


(2) As an example, let us take the transposition matrix 


5 8 6 1 ^ 3 2 7 from which A F 
PYRAMIDS W I 
BCEFGHKL GU 
I' n 'ITU7WX Z R 
l LX 


T D K is the original square 
H V M 
P B H 
E Q S 
Y C 0 


Using the 


methods illustrated in par. 51g, scanning successive rows of the square 
will disclose sequences of letters which could have appeared as columns 


in the transposition matrix 


For example, discovery of the columns I 

H 

V 


S 

L 

X 


will afford rapid recovery of the key word. But if instead of the original 
square we had one of its permutations such as Q S Z R E, then treatment 

C 0 L X Y 
D K A F T 
V M W I H 
BHGUP 


of the "columns", e.g , F 

V 

0 

L 

T 

M 

L 

X 

V 

W 

X 

Y 


of the tentative transposition matiix 


(assuming that some or all of the letters V, W, X, Y, Z are in the last 
row of the transposition matrix) will be without significance, therefore 
the procedure above is inapplicable without a slight modification. 

(3) Since it will he noted that a permutation of the rows will not 
affect the procedure of keyword recovery, then we construct a 9x5 rec¬ 
tangle QSZREQSZF which contains the five squares which result 
COLXYCOLX 
DKAFTDKAF 
VMWIHVMWI 
BNGUPBEGU 


simply from successive permutations of the columns. A 5x5 cut-out square 
will be found convenient in testing each permutation in turn Affirma¬ 
tive results will be obtained when the correct permutation is reached. 
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which in this case is the third square in the rectangle, namely, 

Z R E Q S. After recovery of the key word from this permuted “square it 

L X Y C 0 

AFIDK 

W I H V M 

GUPBN 



is probable then that the original enciphering square must have been 


A F T D K. 
W I H V M 
GUPBN 
Z R E Q S 
L X Y C 0 


(4) In the case of the square recovered in Fig 68d, it is found 
that, following the procedure outlined in subpars (l), (2), and (3) 
above he key word is based on COMPANY, recoverable from the following 
diagram* 


2 5 3 6 14 7 

COMPANY 
B D E F G H I 
K L Q R S T U 
V W X Z 


The original square must have been this 


A 

G 

S 

C 

B 

K 

V 

M 

E 

Q 

X 

N 

H 

T 

0 

D 

L 

W 

P 

F 

R 

Z 

Y 

I 

U 



Figure 60£ 

g Continued practice m the solution of Playfair ciphers will make 
the student quite expert in the matter and will enable him to solve 
shorter and shorter messages Also, with practice it will become a 
matter of indifference to him as to whether the letters are inserted in 
the square with any sort of regulai lty, such as simple keyword-mixed 
order, transposition-mixed ordei, or in a purely random order 

h It may perhaps seem to the student that the foregoing steps are 
somewhat too artificia3, a bit too "cut and dried" in their accuracy to 
portray the process of analysis as it is applied in practice For 
exaniple, the critical student may well object to some of the assumptions 
and the reasoning in subpar £(5), above, in which the words THREE and 


* The author once had a Student who "specialized" m Playfair ciphers 
and became so adept that he could solve messages containing as few as 
50-60 letters within 30 minutes 



RESTRICTED 


214 




REF ID:A56892 


t RBOTRIC ted , 

ONE (lot hypothesis) weic iejected in favor of the words THIRD and SECOND 
(2nd hypothesis) Thus looted 3aigely upon the reject ion of REp end ERp 
as blio equivalents of UZ C and ZU C , and life adoption of IRp and Rip as 
the u equivalents Indeed if the student wil3 oamme the fino3 message 
vitli a critical eye, he will find that while the hit of leasonjng m step 

(5) is perfectly logical, the assumption upon which It i3 based is m_ 

fact viong, for it happens that in this case ERp occurs only once and REp 
does not occur at all Consequently, although most of the reasoning 
which led to the 1 ejection of the first hypothesis and the adoption of 
the second was logical, it was in fact based upon erroneous assumption 
In other words, despite the iact that the assumption was incorrect, a 
correct deduction was made The student should take note that in crypt ¬ 
analysis situations oi this sort aie not at all unusual Indeed they are 
to be ejected, and a few words of explanation at this point may be use¬ 
ful 

jL Ciyptanalysis is a science in whJch deduction, based upon ob¬ 
servational. data, plays a veiy large role But 3t is also true that m 
this science most of the deductions usually rest upon assumptions It 
is most often the case that the cryptanalyst is forced to make his as¬ 
sumptions based upon a quite limited amount of text. It cannot be ex¬ 
pected that assumptions based upon statistical generalizations will 
always hold true when applied to data comparatively very much smaller in 
quantity than the total data used to derive the generalized rules 
Consequently, as regards assumptions made in specific messages, most of. 
the time they will be correct, but occasionally they will be incorrect ki¬ 
ln cryptanalysis it is often found that among the correct deductions 
there will be cases in which subsequently discovered facts do not bear 
out the assumptions on which the deduction was based Indeed, it is 
sometimes true that if the facts had been known before the deduction was 
made, this knowledge would have prevented making the correct deduction 
For example, suppose the cryptanalyst had somehow or other divined that 
the message under consideration contained no RE, only one ER, one IR, and 
two RI's (as is actually the case) He would certainly not have been 
able to choose between the words THREE and ONE (1st hypothesis) as 
against THIRD and SECOND (2d hypothesis) JBut because he assumes that 
there should be more ERp's and REp's than IRp's and Rip’s in the message, 

he deduces that UZ C cannot be REp, rejects the first hypothesis and takes 
the second It later turns out, after the problem has been solved, that 
the deduction was correct, although the ass ump tion on which it was based 
(expectation of more frequent appearance of REp and ERp) was, in fact, 
not true m this particular case The cryptanalyst can only hope that 
the number of times when his deductions are correct, even though based 
upon assumptions which later turn out to be erroneous, will abundantly 
exceed the number of times when his deductions are wrong, even though 
based upon assumptions which later prove to he correct If he is lucky, 


Ta 


See footnote 1 j on page 52 
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the making of an assumption which is really not true will make no dif¬ 
ference in the end and will not delay solution, hut if he is specially 
favored with luck, it may actually help him solve the message—as was the 
case in this particular example 

Another comment of a general nature may he made in connection 
with this specific example. The student may ask what would have heen the 
procedure in this case if the message had not contained such a tell-tale 
repetition as the word BATTALION, which formed the point of departure 
for the solution, or, as it is often said, permitted an "entering wedge" 
to he driven into the message The answer to his query is that if the 
word BATTALION had not heen repeated, there would prcibahly have heen some 
other repetition which would have permitted the same sort of attack * If 
the student is looking for cut and dried, straightforward, unvarying 
methods of attack, he should remember that cryptanalysis, while con¬ 
sidered a branch of mathematics hy some, is not a science which has many 
"general solutions" such as aie found and expected in mathematics proper 
It is inherent in the very nature of eryptanalytics that, as a rule, 
only general principles can he established, their practical appli¬ 
cation must take advantage of peculiarities and particular situations 
which are noted in specific messages This is especially true in a text 
on the subject The illustration of a general principle requires a 
specific example, and the latter must of necessity manifest character¬ 
istics which make it difierent from any other example The word BAT¬ 
TALION was not purposely repeated in this example in order to make the 
demonstration of solution easy, "it just happened that way" In another 
example, some other entering wedge would have heen found The student 
can he expected to learn only the general principles which will enable 
him to take advantage of the specific characteristics manifested in 
specific cases Here it is desnred to illustrate the general principles 
of solving Playfair ciphers and to point out the fact that entering 
wedges must and can he found. The specific nature of the entering wedge 
varies with specific examples 

72 Analysis of polygraphic systems involving large tables —a The 

analysis of systems incorporating large digraphic tables is accomplished 
hy entering, within the appropriate cells of a 26 x 26 chart, data corres¬ 
ponding to the plain-cipher lelationships of assumed cribs on 26 x 26 
chaits, and examining the charts for evidences of symmetry or systematic 
construction in their compilation The initial plaintext entries may, 
in the absence of cribs, he made on the basis of digraphic frequency 
considerations, aided hy ldiomorphisms and repetitions 

h. In pseudo-digraphic systems, such as those incorporating tables 
similar to Figs 47a and h, and 48, the identification of the monoalpha- 
betically-enciphered component of cipher digraphs will greatly accelerate 
plaintext entries, >mce advantage may be talien of this monoalphabet 1 - 
city. Tables with a feature of reciprocity, such as the example m 
Fig 50 , may be exploited on the basis of this weakness, even if the re¬ 
ciprocal pairs are assigned at random Tables such as that in Fig 49 
and the one for tnnome digraph]c encipherment shown m Fig 51 may also 
be exploited with facility, once enough plain text has been correctly 
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assumed and inserted to disclose their systematic construction A word 
of warning is inserted here against making incautious assumptions con¬ 
cerning the exact internal composition of tables suen as that in Fig 
since their unusual construction could easily mislead the analyst who 
Jumps to premature conclusions. In the case of a tabie such as Fig 53 
wherein the trinomes have been inscribed in straight horizontals (or for 
that matter, any other known inscription), if the dimensions of the table 
have been correctly assumed the simplest solution involves a reduction to 
two alphabets, leflecting the sequences of letters for the side and top 
of the matrix, this solution closely parallels that of the numerical 
four-square system described in subpar 6 $e 

c Because the foregoing principles are rather straightforward, it 
is not considered necessary to illustrate their application with examples 
Of course, when dngraphic tables of random construction have been used, 
no refinements in solution are possible However, the recording of as 
few as 225 different plaintext digraphs and thear ciphertext equivalents 
will theoretically enable the automatic decryption of approximately 92$ 
of the cipher digraphs of messages,and the recording of 335 plaintext- 
ciphertext vaDucs will enable the automatic decryption of 98$ of the 
cipher digraphs, thus almost every message may be read in its entirety 
without recourse to further assumptions Actually, it should be pointed 
out that having only 122 malched plaintext-ciphertext equivalencies will 
theoietically enable the decryption of 75 $ of the cipher digraphs, and 
enough skeletons of plain text may then be manifest to permit the decrypt¬ 
ion of the complete message texts 

d It might be veil to point out in connection with large di¬ 
graphic tables that there exist literal types which give rise to mono- 
alphabetic distributions for both the initial letters and final letters 
of pairs Such a table is illustrated in Fig 69 below 
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Figure 69 . 


In effect, encipherment by means of such a system yields the equivalent 
of a two-alphabet cipher, with a transposition within each of the pairs 
of letters. The cipher text produced by such a system may be character¬ 
ized by a large number of repetitions which begin with the initial letter 
of digraphs and end on the final letter of digraphs and which are pre¬ 
ceded by digraphs having repeated initial letters or which are followed 
by digraphs having repeated final letters; for example, ciphertext 
passages of the following type might often arise: SF BD GB HK and 
§9 BD GB WK (wherein the repeated plain text is actually represented by 
SDBBGK, affected by the transposition). This system is included here as 
being illustrative of many simple systems which are capable of leading 
the student very much astray in this Instance, if one were unaware of 
the transposition feature involved and were to attempt what appears to 
be the simple task of fitting plain text into the two monoalphabetic 
portions on the basis of single-letter frequency considerations, he could 
spend a great deal of time without success—probably without any idea of 
what was causing his difficulties. 

e. A. pseudo-trigraphic cipher involving a table such as that in 
Fig. 52 may be readLly recognized as such, since two letters of each tri¬ 
graph enciphered by means of such a table are treated monoalphabetically. 
If three separate uniliteral frequency distributions are made—one for 
each of the three letters of the cipher trigraphs—two of the distri¬ 
butions should be monoalphabetic. Then, exploiting the monoalphabeticity 
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(i.e., the positional monoolphsbeticity) thus disclosed In the cipher 
text, plain text can be fitted to the cipher on the basis of single- 
letter frequency considerations; in addition, advantage nay be taken of 
partial Idiamorphisms, if these ldiomorphlsms involve the particular 
positions of the trDgraphs vhich have been treated monoalphabetjcally. 

f. Fortunately, it is unlikely that trlgraphlc systems other than 
the foxegoing pseudo-trigraphic type will be encountered, because they 
are difficult to manipulate without extensive tables or complicated rules 
for encryption .^ 1 The subject can be passed over with the simple state¬ 
ment that their analysis requires much text to permit of solution by the 
frequency method,--and blood, sweat, and tears .^2 

73. Further remarks on polygraphic substitution systems , —a. In the 
treatment of the cryptography of the various digraphic systems Tn this 
Section, the rules for encryption and decryption which have been illus¬ 
trated are the "standard" nO°s (l.e., the rules extant in cryptologic 
literature, or the rules monl comnonly encountered in operational prac¬ 
tice). Needless to say, however, there is no cryptologic counterpart of 
the Geneva Convention making these rules sacrosanct, nor forbidding the 
use of other rules for enciphering and deciphering. 

b. In two-square systems and Playfair systems there are possible 
(and. In fact, there have been encountered In operational practice) modi¬ 
fications of the usual enciphering and deciphering rules which, if not 
suspected, may pose difficulties in the identification of such systems 
and in their cryptanalysis. For example, In a vertical two-square system, 
when two plaintext letters fall in the same column, their cipher equiva¬ 
lents might be taken as the letters immediately to the right of or im¬ 
mediately below these plaintext letters. Similarly, in a horizontal two- 
square system, If two plaintext letters are In the same row, their cipher 
equivalents might be taken as those immediately below, or to the right of 
these letters. In Playfair cipher systems, two plaintext letters in the 
same row might be represented by the letters immediately below, two 
plaintext letters In the same column might be represented by the letters 
Immediately to the right; a plaintext doublet might be represented by a 
ciphertext doublet foimed by doubling the letter immediately to the 
right, or below, or diagonally to the right and below, thus removing one 
of the identifying ciphertext characteristics of the normal Playfair 
system. In one case encountered, instead of tire normal Playfair linear 
relationship ABp=BC c , the rule was changed to ABp=GB c (thus allowing a 


a 

However, see in thiB connection Appendix 8, "Lester S. Hill alge¬ 
braic encipherment", which gives a mathematical treatment of true poly- 
graphic encipherment for polygraphs of any size. (See also subpar. 73h). 

k2 “ 

If a trigraphic system is encountered in operational cryptanalysis, 

special solutions would be made possible by the application of cribs, the 
aid furnished by isologs (not only in the same system, but also between 
systems), etc. 
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letter to ’’represent itself”—an "impossibility" In Playfair encipher¬ 
ment), even this simple modification caused difficulties in cryptanalysis 
because variant rules for encryption nad not been considered 

c The placing of cribs in small-matrix digraphic systems may be 
guided by the cryptographic peculiarities of these systems, when the 
general system is known to, or suspected by the cryptanalyst, conversely, 
the placing of a known crib may assist in the determination of the type 
of cryptosystem, or in the rejection of other types of systems For 
example, cribs may be placed in Playfair ciphers on the basis of the 
"non-crashing" feature of the normal Playfair, that is, on the basis 
that in the equation 1 2=3 neither 1 and 3 nor 2 and k can be identical 
In horizontal two-square systems, if = j*p, then «t(3 c must equal (3«tp, 

and if «*P C = (*__ p, then«*.p c must equal (J«tp If, by placing a known crib 

in a cryptogram, evidence of non-rcciprocity is disclosed (e g , if 
ABp=CD c , but CDp=XYc), the ciyptogram may be assumed to be other than a 
vertical two-square cipher, since vertical two-square encipherment yields 
complete leciprocity In eithei type of two-square system, if one of the 
two squares is known (for example, a vertical two-square might be employed 
in which the upper square is always a normal alphabet), the placement of 
cribs is materially facilitated 

d The $ test performed separately on the initial letter and final 
letter of ciphertext pairs from cryptograms produced by small-matrix di¬ 
graphic systems will give results neither close to that expected for 
plain text, nor close to that for landom text The reason for the com¬ 
parative "roughness" or pronounced differences among the relative fre¬ 
quencies in these distributions, as contrasted with the "smoothness" 
expected of random, is that small-matrix digraphic systems are only 
partially digraphic in natuie and that the encryption involves charactei- 
istics similar to those of monoalphabet ic substitution with variants 
This roughness of the uniliteral frequency distributions for the prefixes 
and suffixes, and, for that matter, for the over-all cipher text, re¬ 
flects the partially digraphic nature of the encipherment 

£ If the cipher letters V, W, X, Y, and Z are of very low fre¬ 
quency in the over-all uniliteial frequency distribution of a digraphic 
cryptogram or set of cryptograms, this may be taken as evidence that the 
cryptosystem is a small-matrix digraphic system employing keyword-mixed 
sequences an the matrix or matrices Furthermore, in small-matrix 
systems involving keyword-mixed squares, if 6^ of 08 c is one of the 

letters TADCYZ, the 6^ of the corresponding ©6p is likely to be one of 
these same letteis ^Similarly, if Is one of the letters VUXYZ, then 
6p of the corresponding 66p is likely to # be one of these letters 

f In trmome-digraphic systems employing large tables, the tri- 
nomes may run from 001 to 676, as in Fig 51, or any consecutive set of 
676 tunones in the scale of 1000 possible tnnomes may be used For 
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that mailer, the enlire spon or trinomes 000-999 Bright he mod an such a 
table, with occasional gnpo, to hide the limitation? of this system As 
another means of disguising tie limitation of 676 trmonies an such a 
system, three of the initial digits of the trmonies might have one \ari- 
ont each—thus no limitation would exist m the fust position of tn- 
nomes The 001, or other starting point in the cyclic scale, need not he 
at the upper left-hand corner of the table The Cj6 Li monies m such 
1ablcs may be inscribed in straight horizontals (i e., in the normal 
manner of wilting) es in Fig 51, or they might bo msoribco according 
to some other loute, they probably would not bo inscribed an a rardom 
manner because clumsy "deciphering tables’’would then be necessary It 
is also possible that the brinomcs in e tnnone-digraphic system might be 
converted into tetranomes by the addition of e sum-check (to assist m 
eiror-corroction) 

£ The cryptanalysis of tetranome-tngraphic systems with matiiccs 
similar to that illustrated m Fig 59 involves a modification of the 
technique used m solving inverse four-square systems If the plam- 
componcnt and ciph^i-component sections of the large square have been 
inscribed according to the normal manner of wilting (or any other manner, 
if known ), the first two elements of tne trigraphs may be reduced to a 
pair of cipher alphabets, and these two monoolphabetic substitutions may 
be solved as indicated in subpar 69e The applicability of inverse 
four-square solution principles to this tetranomc-trigraphic system of 
course rests on the fact that the ciphertext sections are known or as¬ 
sumed to contain the dinoncs 00-99 in numerical o’-dor, inscribed in the 
normal manner of writing, the conversion of the I'irst two elements of the 
trigraphs depends upon the knowledge of the monner of inscription of the 
letters of the plain component sections, in order that me four occur ¬ 
rences of the initial letteis and the four occurrences of the final 
letters may be conectly combined into two nonoalphabet ic distributions 
Of course, if the composition of the small square (for the third element 
of trigraphs) is known, the third letter of tngrapns may be automa¬ 
tically deciphered If the composition of the small square is not l rr, o m , 
a consideration of the frequencies of the converted dinomes for the moll 
square (1 e , the coordinates of the square to indicate the third ru Dor 
of trigraphs) may be used to obtain, on entering wedge into this thud 
monoalphabetic substitution 

h There are but a very limited nuraoer of known cipher mechanisms 
which employ the polygraphic cncipnenrent principle m any form U.S 
Patent Ho I51568O .issued to A Henkels m 1924 and U S Potent Ho 
18459*17 issued to T \ T eisner and Hill in 1932 describe two such mechanisms 
which produce poiygraphic sdbstitutio The latter, tnal of Veisner and 
Hill, is of porticular inteiest because it is based on a rather simple 
mathematical process which can yield true polygraphic encipnerment for 
polygraphs of any size The underlying mathematical process, j.r/entcd 
oy Pi of Lestei S Hill of Hunter College ard described In the ’Wcrican 
Mathematical Monthly" m 1929 (Vol XXXVI, p 306) and 1931 (Vol 
XXXVIII, p 135), is treated briefly, oelow, a more detailed xreatment 
is contained m Appendix 8, "Lester S Hill algebraic encipherment", 
winch also includes remarks on the cryptanalysis of tms method of cn- 
cipheiment 
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(1) Snnce Professor Hill's system is mathematica] in nature, the 
first step in its use involves the conversion of the plaintext letters 
into numbers by means of a conversion alphabet which shows a corres¬ 
pondence between the 26 letiers of the alphabet and the 26 numbers from 
0 to 25, such as the following* 

ABCDEFGHIJKLMNOPQRSTUVWXYZ 
0 9 3 5 24 6 18 8 11 1 21 14 15 12 4 10 25 17 7 19 20 2 22 16 23 13 

(2) The numbers obtained through the conversion of the plaintext 
letters are next treated arithmetically through the application of alge¬ 
braic linear functsons, this treatment being performed by means of mod 26 
arithmetic The numerical results yielded by the algebraic treatment 
are then converted back into letters by means of the conversion alphabet*, 
to yield the cipher equivalent of the original plain text 

(3) For example, suppose that the message "NOTHING TO REPORT" is to 
be enciphered by trigraphs, and bhat, for this purpose, the enciphering 
keys^ are 1, 2, 1, 5, H, 3, 2, 4, 13 The message would be divided 
into trigraphs NOT-HIN-GTO-REP-ORT and the letters vhich result from the 
following operation would be taken as the cipher equivalent of the first 
trigraph. 

Using the conversion alphabet in (l), above, (NOT) is converted 
into (12 4 19), then the foregoing keys are applied— 

1* 12+ 2x4 + 1x19 = 12 + 8 + 19 = 13+1(26) = Z 

5*12 + 11* 4 + 3*19 = 8 + 18 + 5 = 5 + 1(26) = D 

2*12+4*4 + 13*19 = 24 + 16 + 13 = 1+2(26) = J 

Thus, NOTp is enciphered as ZDJ C 

(4) A large number of sets of enciphering and deciphering keys can 
be constructed It is even possible to construct keys which yield re¬ 
ciprocal encipherment, and it is this possibility which makes practicable 
the construction of a machine or device to accomplish the enciphering and 
deciphering 


T ‘ Vo' ' "" ^ ' 

2 Using "mod 26 arithmetic", one considers as the sum or product of 
two numbeis, the number from 0-25 which is obtained by subtracting 26 
(or a multiple of 26) from the ordinary arithmetical sum or product of 
the numbers 

44 

Encipherment of polygraphs containing n letters requues the use of 
n 2 keys Thus, 9 keys are necessaiy ior trigraphic encipherment, di- 
giapluc enclpheimenu requires only 4 keys, whereas tetragraphic and 
pentagraphic encipherment necessitate the use of 16 and 25 keys, respec¬ 
tively The numbers selected for use as keys mu%t be chosen according 
tq rathei definite lules based on tne "theory of determinants", other¬ 
wise, cryptographic ambiguity may result when decipherment is attempted 
Appendix 8 contains more on this matter 
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i. Attention is called here to the appllent Lons of Table 13 ("Four¬ 
square individual frequencies") of Appendix 2j this table has been re¬ 
produced here for convenience. If the cryptanalyst has at hand a fairly 

(Table 13, Appendix 2) 

(TUinmI on a count of 6,000 dlgrnplin] 
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large volume of cipher digraphs produced by encipherment with a normal 
four-square, he may use Table 13 as on aid in placing the initial letters 
and final letters of the cipher digraphs into the appropriate cell3 of the 
cipher component sections on the basis of their uniliteral frequencies . 

Thus, if a distribution made of the initial letters of cipher pair, m a 
particular example shows Qq, I c , and C c to be the letters of predominantly 
high frequency (listed in descending order of frequency), and if the 
distribution of the final letters shows F c , Qc, and P c as the letters of pre ¬ 
dominantly high frequency (in descending order of fr.quency), these 
letters may be tentatively placed into a skeleton four-square matrix as 
follows (Fig 70), based on the locations of the highest frequencies as 
given m Table 13 
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Figure 70 


In attempting to diagnose the underlying cryptosystem in any 
particular polygraphic cipher, the student may gain some assistance from 
the following recapitulation 

(1) In digraphic ciphers the majority of repetitions will "be an 
even number of letters apart and these repetitions should for the most 
part "begin on the first letters of pairs and end on the last letters of 
pairs The majority of repetitions m trigraphic ciphers will be some 
multiple of three letters apart and these repetitions should for the most 
part begin on the first letters of brigraphs and end on the last letters 
of trigraphs, 

(2) Digraphic cipheis may be revealed as such by the digraphic phi 
test, with additional support being given by the digraphic blank- 
expectation test, the presence of a null letter at the beginning of the 
cipher text might be disclosed by applying the two foregoing tests to a 
distribution of the digraphs which are formed when the first letter of 
the text is omitted 

(3) If either the uniliteral frequency distribution for the initial 
letters or for the final letters of the digraphs in a cryptogram exhibits 
monoalphabeticity, the cryptogram is probably a pseudo-digraphic cipher 
involving a large table of the type in Fig 47 or 48 If both of the 
foregoing uniliteral frequency distributions reflect monoalphabeticity, 
the cryptogram may involve the use of a table of the type in Fig 69 

(4) If the "decipherment” of a cryptogram by means of a four-square 
matrix containing four normal alphabets yields two'monoalphabetic substi¬ 
tutions—one for the initial letters and one for the final letters of the 
pseudo-decipherment--the cryptogram may be assumed to be an inverse four¬ 
square cipher. 

(5) If an ocular inspection or statistical evaluation of the cipher 
text of a cryptogram reveals a large number of "transparencies", the 
cryptogram probably involves a two-square system 
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(6) If a cryptogram contains several cnpher doublets, all of which 
are bioken up when the cipher text is divided into digraphs, the crypto¬ 
gram may well involve normal Playfajr encipherment 

(T) If the cipher text of a cryptogram exhibits any invariable af¬ 
finity of one of the letters J, K, Q, X, or Z for vowels (or, for that 
matter, another cluster of 5 or 6 letters), the cryptogram probably is 
in a small-matrix system employing sections consisting of moie than 25 
letters 

k If a particular four-square cryptogram involves the use of a 
matrix in which either the plain component sections or the cipher com¬ 
ponent sections are normal alphabets, the matrix will be recovered 
through cryptanalysis in its original form, even when the components 
which aie maxed have been derived by a transposition method or by no 
method at all In Playfair cipher solution, the matrix can be recovered 
m its original form as long as the original matrix has been mixed in 
some systematic manner However, in the case of two-square solution, 
there is no guarantee that the matrix can be recovered in its original 
form unless the original matrix has been keyword-mixed, if the original 
has been transposition-mixed, for example, the matrix which has been 
recovered through cryptanalysis—while being cryptographically equivalent 
to the original—will undoubtedly involve a permutation of the rows and 
columns of the original 

1 When four-square systems are encountered in which the matrix 
consists of four differently-mixed sections, reconstruction of the matrix 
is accomplished m a manner similar to that used in the analysis of two- 
square cipheis If the sections are composed of keyword-mixed sequences, 
the original matiix may be recovered Otherwise, the reconstructed 
matrix will in all probability be a permutation of both the rows and the 
columns of the original matrix, and there may be no way of lecovenng or 
or proving the original matrix 

m In passing, it might be well to mention that an/ two-square 
system can be solved as a four-square system in which tlif* matrix :sa 
composed of four mixed sections, upon the realization, fivora ptenemena to 
the matrix reconstruction, that a two-square matrix is involved,, the 
proper conversion ca Ti then easily be made 
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SECTION X 

CRYPTOSYSTEMS EMPLOYING IRREGULAR-LENGTH CIPHERTEXT UNITS 


Paragraph 

Preliminary observations .....74 

Types of alphabets with irregular-length ciphertext units... 75 

General remarks on analysis................... 7o 

Analysis of simple examples...*.... 77 

Analysis of more complicated examples. 7& 

Further remarks on cryptosystems employing Irregular-length cipher- 

text wits..... 79 


74. Preliminary observations .—a. The cipher alphabets of Marly all 
of the various cryptosystems treate&"*thus far in this text have involved 
cipher units of a constant length. 1 That is, the ciphertext units have 
been (prior to regrouping into fives for transmission) either single charac¬ 
ters, or pairs of characters, or three-character groupings, or, in the case 
of the Baconian and Baudot alphabets, 5-element ciphertext units; however, 
within a given cryptosystem the lengths of the ciphertext units have been 
consistent , and it is this consistency that has been of most importance to 
the cryptanalyst. 

b. There is no reason why a cryptographer could not vary the size of 
the cipher units in a particular cryptosystem, as long as no cryptographic 
ambiguity in deciphering would resui fc thereby. Furthermore, if the 
size of the cryptographic units is varied within a particular cryptosys¬ 
tem, obstacles are put in the way of cryptanalytic attack on the system- 
varying the length of the ciphertext groupings complicates the crypta¬ 
nalyst* s preliminary task of dividing the cipher text into the proper 
units for study. In this connection, the student should refer back to 
par. 63 and read again the remarks on the use of nulls which differ in 
size from the real cryptographic wits. The example contained therein 
makes it clear that, until such nulls are identified and isolated by the 
cryptanalyst, he Is unable to divide the cipher text properly and make 
appropriate frequency distributions. However, nulls may sometimes be rec¬ 
ognized as such because they do not behave like wits which represent 
actual plaintext elements. For example, in the three almost-identical 
ciphertext passages below, 

(a) ... 181^5 11343 71129 3219^ 23231 52937... 

(b) ...18151 01343 71129 32192 32^31 52937... 

(c) ...18151 13437 1^129 32192 30231 52937... 


The only exceptions have been in the digraphic systems using the 
matrices illustrated in Figs. 57a and 57b, in which a plaintext digraph 
may be represented by a ciphertext digraph, trigraph, or tetragraph, 
depending upon the identity of the plaintext digraph. 
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the "behavior of the digit 0 is characteristic of a mil, and when this is 
recognized and eliminated, the remaining cryptographic text may "be "broken 
up into its real units and solved quite readily. 

c. Since it has been indicated above that there are weaknesses in 
a scheme in which all cipher elements do not behave like equivalents for 
plaintext elements, it would be logical then to devise a system in which 
different-sized ciphertext units all represent actual plaintext elements 
and thus do behave more or less alike. It is easy to draw up cipher alph¬ 
abets in which, for example, some of the letters are represented by single 
digits, others by pairs of digits• Such a system, called a monome-dinome 
system2, would produce cipher text which is an n-regular intermixture of 
uniliteral and multilitcral equivalents. From the cryptanalytic standpoint, 
the decomposition of such cipher text could be very difficult for the an¬ 
alyst who does not know which digits to treat separately, which in pairs • 
Such systems, and similar variations, are given detailed treatment in the 
following paragraph. 

75. Typos of alphabets with irregular-length ciphertext units .—a. One 
simple scheme for yielding single-digit equivalents for some letters and 
two-digit equivalents for others makes use of a rectangular matrix which 
is similar to some of the biliteral matrices of Sections VII and VIII, 
but which differs in that the top row of the matrix has no indicator (or 
coordinate). For examples, see Figs* 71-74, below. Each plaintext char- 



Flgure 71. 


Figure 72. 


01 2 3^6 0 18 3 4 


- 

RATIONS 

- 

AIIOU 

7 

BCDEFGH 

5 

B C D F G 

8 

JKLMPQB 

2 

EKLMN 

9 

V W X Y Z . * 

9 

P Q R S T 



7 

V W X Y Z 


Figure 73 • Figure 7 1 * • 


acter appearing in the top row in the matrix has as its cipher equivalent 
merely the monome which appears above it, among the coIvji coordinates; 
thus, in Fig. 71, Ep B 0c, 5?p " lc, Wp s 8 C , etc. Each plaintext character 


^ See in this connection Foote, Alexander, Handbook for Spies , Hew York, 
19^9, pp. 250-256, wherein is described such a cryptosystem reputedly 
typical of those used by secret agents in World War II. 
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appearing In one of the remaining rows has as its cipher equivalent the 
dinome formed by its row coordinate and column coordinate, respectively? 
thus in Pig, 71 , Gp * 7 ^ c , Op * 6 l c , etc. 


b. It should he noted that the external construction of all of the 
foregoing matrices is such that any digit which appears as a row coordi¬ 
nate does not occur as the monome equivalent for any letter} this limita¬ 
tion, accomplished hy blanking out appropriate cells in the top row, is 
necessary in all monome-dinome systems in order that cryptographic ambi¬ 
guity will not arise. In Fig. 71, the internal composition is such that 
the plaintext letters which are most frequent in English are the ones 
which are provided with monome equivalents. This type of arrangement 
theoretically provides the most economical encryption for any given mes¬ 
sage—that is, theoretically yields the shortest possible cipher text for a 
given plain text—but, of course, greatly limits the number of internal 
arrangements which maybe used. Fig. 75, below, which Is split into two 
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5 2 9 7 6 
ABC D F| 
G H I J K 
L II P Q S 
U V W X Y 

Z . ( ) * 


0 1 8 3 4 
ETH80 


Figure 75a. 


Figure 75h. 


separate parts--one providing the monome equivalents and the other provi¬ 
ding the dinome equivalents—illustrates another scheme for drawing up a 
monome-dinome cipher alphabet. In this alphabet, the digits which are 
used for the initial and final elements of dinomes are completely distinct 
from the digits used as monomes. 


c. Most of the foregoing matrices contain a period for punctuation, 
and the matrix in Fig. 72, containing the single digits 0-9, provides a 
means for encrypting numbers without first spelling them out. The matri¬ 
ces in Figs. 71, 73, and 75 contain another character, symbolized by an 
asterisk, which maybe used for punctuation or as a special lndieator3. 
The matrix in Fig. 7^ uses only nine of the single digits as coordinates, 
the digit 6 being omitted; this single digit might be employed as a word 
separator, a stop, or a null. The matrix in Fig. 76 , below, illustrates 


9 

For example, this special character may be put to use as an indica¬ 
tor to show that plaintext numbers begin or end, thus obviating the 
necessity of Including digits within the cipher matrix. In this usage 
digits in the plain text might be tripled and inserted in the cipher text 
with the appropriate indicator before and after the plaintext digits. 
Thus, using the matrix in Fig. 71, the plaintext fragment"..HILL 865 .#" 
would be encrypted as the cipher sequence 75 2 77 77 66 888 666 555 66 
(prior to regrouping into five-character groups). 
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a scheme hy which certain high-frequency plaintext digraphs and trlgraphs 
may he represented in the matrix, as well as the single letters and 
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Figure 76 . 

digits. The symbol # in this latter matrix could he used as a Repeti¬ 
tion indicator” for checking numbers, as in the ciphertext passage 
69 65 68 76 69 65 68, meaning the number 752, the symbol * might he used 
as an indicator meaning "the immediately-preceding plaintext letter is 
repeated” (thus AA patterns would he suppressed in the cipher text). In 
all of the foregoing matrices the order of inscription of the letters 
within the matrix, and the particular arrangement of the row- and column- 
coordinates are "both subject to variation. 

d. By piearranged convention it is possible to employ ordinary commu¬ 
tative bipartite matrices (such as those already described in Sections VII 
and VIII) in a manner which yieJds monome-dinome encipherment. For example, 
using the matrix illustrated in Fig. 77, the plaintext word EIGHT could be 


1 

2 

3 

4 

5 

Figure 77. 

encrypted as 10 29 7 6 49. That is, the normal bipartite enciphering con¬ 
ventions would be used, with the exception that the row indicator in the 
cipher equivalent for a particular plaintext letter would not be employed 
when this row Indicator is the same as that for the immediately preceding 
letter of the plain text As may be noted, no cryptographic ambiguity in 
decipherment may arise. 

e_. Of course, as an extension of the foregoing ideas, there could 
also be monome-dinome-tri nome systems, incorporating matrices of the types 
Illustrated in Figs." 78-82, below. In Fig. 83 there is a matrix which may 


A variation of this method could make use of a convention by which 
the column indicator is dropped if it is the same as that for the pieceding 
plaintext letter. 
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Figure 79a. Figure 79b. 
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"be used for dinome-trinome encipherment. Encipherment with this latter 
matrix is commutative; for example, Ep « 24 or 42, and Tp * 621 or 162 . 

f. Literal versions of the preceding types of alphabets with Ir¬ 
regular-length cipher units are also possible. Several types are illus¬ 
trated in Figs. 84-88, including among them matrices permitting the use 
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of variants in encryption. Furthermore, any of the commutative variant 
matrices treated in par. $8a (i.e.. Figs. 27, 28, and 31) maybe used in 


CONFIDENTIAL 


231 




CONFIDENTIAL 


REF ID:A56892 


connection with the convention described in subpar. d, above, to provide 
cipher alphabets with irregular-length ciphertext units. 

76. general remarks on analysis .—a. The first step in the analysis 
of any cryptogram encrypted in a system with in egular-length cipher 
groupings involves dividing the cryptogram into the proper, vari-slzed 
cipher units—that is, reducing the cryptogram to monoalphabetic terms. 

After this has been done, solution proceeds along the straightforward 
lines which have been described in earlier sections of the text. Thus, 
in this section, attention will be focused on this first step of dividing 
the text into its pioper monoalphabetie units. In order to simplify some¬ 
what the general treatment contained in this paragraph, all remarks will 
he directed at monome-dinome systems, most of the principles and methods 
outlined herein are general enough that they may he modified end applied 
in the solution of other types of systems with irregular-length ciphertext 
units. 

b. A cryptographer, in his process of deciphering a particular 
monome-dinome cryptogram, would begin by considering whether or not the 
first digit of the cipher text were among those digits which can start 
a dinome—■that is, whether it were a row coordinate or not. If It were, 
he would treat it along with the next digit of the text as a dinome, 
and then proceed to consider whether or not the following digit were a 
row coordinate, etc. If the first digit of the message were not a row 
coordinate, he would treat it as a monome, and then proceed to consider 
whether or not the next digit were a row coordinate, etc. One may now 
see that the cryptographic process of dividing the cipher text into its 
propel units is based solely on a knowledge of the digits which are the 
row coordinates of the pertinent matrix. Thus, It may further he seen 
that the cryptanalytic attack on a monome-dirzoice cipher would first 
involve an attempt to determine the identity of the row coordinates. 

£. If a given cryptogram involves a matrix in which the high- 
frequency plaintext elements are evenly distributed throughout the various 
rows, it may be expected that the particular digits occurring with the 
greatest frequency in a uniliteral frequency distribution made on the cipher 
text are those which are row coordinates of the pertinent matrix. This 
may he explained by the fact that the digits used as row coordinates occur 
in the cipher equivalents for more plaintext letters than do those digits 
which are used as monomes. However, one must remember that a monome- 
dinome matrix may involve two, three, four, or more row coordinates and, 
although in a particular instance it may he that the most frequent cipher 
digits are those digits which have been used as row coordinates, a study 
of the uniliteral frequency distribution may not make ic cbvious as to 
just how many coordinates are involved, it may he necessary to make 
several trials, one considering only the two most frequent cipher digits 
as row coordinates, one considering the three most frequent, etc. 

d. If trials of the type just mentioned do not yield reduced, mono- 
alphabetic text which will succumb to the principles of plaintext recovery 
treated in the earlier sections of this text, it ray then he assumed that 
the cryptogram involves a matrix in which several of the high-frequency 
letters are arranged together in the top rcar or in which one or more columns 
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axe composed »o3ely of high-frequency letters. Such matrices are likely 
to produce cipher text in which some of the digits which have “been used 
as monomes occur more frequently than some of those used as row coordi¬ 
nates, Thus, the easy mode of entry via the uniliteral frequency dis¬ 
tribution may not be used, and other approaches of a less clear-cut 
nature must be taken. 

e. In an attempt to identify at least one or two probable coordi- 
nates7 the analyst should carefully scrutinize the cryptogram itself in 
order to find passages exhibiting bipartite characteristics , such as ap¬ 
pear in the sequence 80438 I 87 I 1 J, wherein the' digits 8 and 4 "act” like 
digits which have been used as row coordinates’, being spaced off at in¬ 
tervals of two, A slightly more objective approach Involves first making 
a biliteral? distribution of the cipher text, and then considering as a 
probable row coordinate the Initial digit of the particular dinome which 
the distribution shows to be the most frequent. Of course, this ap¬ 
proach is most likely to be valid when the particular dinome occurs with 
a much greater frequency than the remaining dlnomes. While still on the 
subject of distributions, it is pointed out that the previously-mentioned 
"bipartite characteristics" manifested in a cryptogram might be disclosed 
by making a b3litoral distribution of alternate digits of the cipher 
text®, that Is, in the sequence 123456"one would consider the dlnomes 13, 
24, 35, 46. In such a distribution, one may expect that the most frequent 
dinomes will be those comprising two digits which were both row coordinates 
of the pertinent enciphering matrix. 

f. If the cipher text of a given monome-dinome cryptogram begins 
with a doubled digit, this digit is most probably one of the row coordi¬ 
nates of the pertinent matrix; otherwise, the doublet would have to be 
considered as comprising two monomes and the first word of the underlying 
plain text would have to begin with a doublet (a very rare contingency 

in the English language). Similarly, if the cipher text is seen to contain 
any digit repeated consecutively four or more times, the particular digit 
may be assumed to be a row coordinate; otherwise, such a sequence of re¬ 
peated digits would have to represent at least a threefold repetition 
of some one plaintext letter (another rare event in English, although not 
as rare as that mentioned in the preceding sentence). 


5 The use of the term "b iliteral 11 in connection with digit cipher text 
may not be in conformance with the strictest rules of semantics, but the 
author feels that it is unnecessary to give a new name to an already- 
familiar type of distribution merely because it is being applied to a 
different kind of text. However, some who prefer to be purists in this 
matter term a digraphic distribution which is made on digit text as a 
"dinome distribution" or "dinomic distribution", and a biliteral distri¬ 
bution made on digit text, a "running dinome distribution". 

® In the vernacular such a distribution is termed an "A-A" (pronounced 
"ay-dit-ay") distribution. 
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g. On occasion it may be found that much time has been spent in the 
attempt to identify the row coordinates, yet apparently with nob all of 
the coordinates being identified. In such a case, it may be found useful 
to consider those digits which are least likely to be row coordinates, 
specifically, those which occur least frequently m the cryptogram. The 
analyst may go through the troublesome cryptogram and place a slant bar 
(virgule) directly alter each such digit as it occurs in the message. 
These marks may then be taken as an indication of places in the crypto¬ 
gram where one bona fide cipher unit ends and the next begins. The ana¬ 
lyst must then study the digits which directly follow these slant bars 
with a view to discovering new possibilities for row coordinates—possi¬ 
bilities which, although previously latent, have been made patent by this 
latest step. 

h. In the foregoing subparagraphs, a to g, the secondary step of 
testing tor the corroboration or invalidation of any particular trial 
decomposition has been passed over quite briefly. Actually, this step 
is best described with specific examples of solution, and for this 
reason is treated in two subsequent paragraphs, 77 and 'jQ, with such 
examples• However, a few methods which can be applied for the rejection 
of incorrect hypotheses will be mentioned here, because they are rather 
basic and simple. If the cryptanalyst finds, after having divided a 
monome-dinome cipher on the basis of a particular hypothesis, that a long 
repetition in the cryptogram is not broken up in the same way on each of 
its occurrences, he may well reject as incorrect the hypothesis on which 
the division is based. Likewise, the analyst may reject any hypothesis 
which requires him to make the last digit of a cryptogram a monome when 
this particular digit has to be considered as a row coordinate as part of 
the basic assumption .7 The presence of an inordinate number of consecu ¬ 
tive monomes may cause one to suspect that a particular decomposition is 
incorrect; however, probably only continued exposure to traffic of a 
certain type or involving one kind of enciphering matrix would provide 
pne with a sound basis for knowing Just how many are too many . 

i. There is one practical, straightforward measure for determining 
the relative goodness of an assumed decomposition which deserves particu¬ 
lar mention. It involves considering the ratio of the number of monomes 
produced in a particular decomposition to the number of remaining cipher 
units. In the case of monome-dinome ciphers, for example, In wnich an 
assumption of only two row coordinates is made, there can be no more than 
eight different plaintext letters represented by monomes and the total 
frequency of those monomes can not exceed the frequency expected of the 
eight most frequent letters in the language.8 Since in English the eight 
most frequent letters occur with a total relative frequency of 66$, any 
trial decomposition giving rise to a ratio of monomes to dinomes which is 


7 

However, the possibility of a final null or nulls must not be ig¬ 
nored; the presence of nulls at the end of the cipher text would invalidate 
this reasoning. 

o 

° The only exception to this statement would he a case wherein a word 
separator is included as part of the cryptosystem, and that this separator 
is renresented by a monome. This usage, however, seems rather unlikely. 
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considerably wore than 66 to 3** (* 1.9) may he considered incorrect. 
Likewise, since an assumption of three row coordinates limits to seven 
the number of different plaintext letters which may have monome equiva¬ 
lents and since the seven most frequent letters in English occur with a 
total relative frequency of 60$, any such assumption giving rise to a 
ratio of monomes to dinomes which is considerably more than 60 to 40 
(* 1.5) may he considered invalid. The author does, however, hasten to 
point out that a ratio which is smaller in any instance than the pertinent 
ratio, above, does not disprove the particular trial decomposition since 
the plaintext letters represented by monomes may not necessarily be the 
letters of highest frequency. The examples in the next two paragraphs 
will serve to clarify the foregoing considerations. 


T7» Analysis of simple examples .—a. The following cryptogram, 
suspected to be a monome-dinoiae cipher,“is available for study: 
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24090 15709 08121 02092 92405 56001 
27072 90482 47607 09022 10209 29724 
07292 93257 52961 09042 72002 07247 
50570 96081 72409 29040 40971 24097 
29128 76090 40750 65297 09067 20902 


P 


09040 74076 


Cursory examination of the cipher text reveals nothing more significant 
than the fact that the digit 3 is absent; however, the significance of 
this escapes us for the moment. A uniliteral frequency distribution of 
the text is then made, as is illustrated below: 


gg j2: 

% _ er §£ 

£ ^ 

£ S & 

!: 2? 2= S gi 

1234567890 

b. The uniliteral frequency distribution shows four marked peaks 
(2,7,9, and 0) and one pronounced trough (8). A biliteral frequency 
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distribution is made, as shown below, to assist in further evaluation of 
the properties of the cipher text. It is noted that the 2 and 0 rows. 


1 

2 

3 

4 

5 

6 

7 

8 

9 

0 


representing the two highest-frequency digits in the cipher text, have 
the most liberal combinations with the remaining digits; this would indi¬ 
cate that 2 and 0 are likely row coordinates of the cipher matrix. Since 
the 7 and 9 rows show less affinity of these digits for other digits, 7 
and 9 sure less likely to represent row coordinates of the matrix; conse¬ 
quently the assumption is made that the matrix involved only two numbered 
row coordinates, 2 and 0. 

c. The cryptogram is now divided accordingly, and the assumption of 
2 an&*”0 as row coordinates is borne out by the bipartite character of the 
following passages in the cipher text: 

(1) .../ 21 / 02 / 09 / 29 / 24 / 05 /... (at Al4) 

(2) *../07/09/02/21/02/09/29/... (at Bl4) 

( 3 ) ». ./09M/27/20/02/07/2V... (at Cl6) 

(4) .. ./ 2 V 09 / 29 MM/ 09 / • • • (at D12) 

A frequency distribution of the decomposed text is made, as illustrated 
below: 

123456789 0 
-|7 U " 1 5 b 12 1 lL 
22 -- 71-2192 
0 I 15-62272131 

The percentage of monomes, 35$, does not exceed the threshold for the 
sum of the frequencies of the eight highest-frequency plaintext letters; 
furthermore, since the eight monomes have a much lower frequency than the 
sum of the eight highest-frequency letters in English, this is an indi¬ 
cation that some of the monomes represent plaintext letters of lower 
frequency. 

d. The decomposed text may now be solved, and the message is found 
to begin with the words “SABOTAGE PLAITS..," The original matrix is recon¬ 
structed, and is discovered to be based upon the key word VERI10UTH, as 
follows: 


12345678 9 0 


-5--1-1- 

m* 

3 

21-71-21 

9 

6 

-1----21 

m* 

10 

-2--113- 

- 

2 

1-1-1- 

ft* 

4 

18-133-- 

- 

5 

21----1* 

ft* 

_ 

2 5 - - - 2 4 - 

_ 

10 

25-62272 

l4 

2 
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9 1 6 li 587302 
Vermouth| 

ABCDFGJJ K LI 
Iff Q S W X Y Z . J 

The reason fox- the absence of the digit 3 in the cipher text may now he 
seen: the digit 3 forms a part of only the letters H, J, and Z, and 
these letters did not occur In the plaintext message. 

e. Solution of certain other cases of mixed-length systems pro¬ 
gresses as easily as did the solution of the foregoing example. 

(1) For instance, in the case of a cryptogram produced hy a matrix 
with which the digits used for "both the initial and final digits of di- 
nomes are completely distinct from the monome digits (e.g., Fig. 75), It 
may he seen that ’'eliminating" from the cipher text those particular di¬ 
gits which were used as monomes in the original enciphering alpnahet will 
leave the remainder of the cryptogram broken up into units all of which 
contain an even number of digits. (This would not he true in the case of 
other types of matrices, such as Figs. 71 _ 7 ) +> since eliminating the digits 
which were used as monomes in the pertinent alphabet would remove not 
only actual cipher monomes hut also the final digits of many cipher di- 
nomes.) Baaed on this fact, if one is confronted with a cryptogram which 
he assumes to have been produced hy a matrix such as thai in Fig. 75; he 
may use a mechanical method by means of which he will quickly he able to 
determine which digits arc row coordinates and which are not, or, if his 
basic assumption concerning the type of matrix involved is incorrect, the 
error will quicxly become knovn 1o bun. He need only make successive 
trials each of which involves considering a diffeient one of the 10 di¬ 
gits as being one of those which is a rnonome in the pertinent alphabet, 
"eliminating" the particular digit from the cryptogram in each trial will 
inevitably lead to other digits which must also be eliminated throughout 
the cryptogram in order to maintain the stipulation that all the cipher 
units which remain must contain an even number of digits. For example, 

if one assumes that "0" is a digit which was a monome, then he must fur¬ 
ther assume from a sequence of cipher digits such as $50350 that "5” is 
also a digit which was a monome; and then likewise "3". Any particular 
one of the ten trials which is based on an lneomect Initial assumption 
may be expected to end up with all ten digits being considered as digits 
which were monomes. 

(2) Tn the case of a monome-dinome system in which the row coordi¬ 
nates of the enciphering matrix are distinct from the column coordinates 
(as in Figs. 73 and 7 ^); solution is expedited by capitalizing on the 
fact that the digits within the family comprising the row coordinates do 
not (end cannot) contact themselves or any other digits within the family; 
using Fig. 73 as an example, it is obvious that the digits 7; 8; and 9 
can never be followed by a 8, or 9* A cryptogram enciphered by such 

a system may be expected to contain much fewer cipher doublets than would 
a cryptogram produced by a matrix without the foregoing limitation, and 
the doublets which do occur will themselves involve but a limited number 
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of the 10 different digits. When solving such a cryptogram, the crypta¬ 
nalyst need only consider as possible candidates for row coordinates those 
particular digits which do not appeal in cipher doublets. Furthermore, 
he may with certainty go through the cryptogram placing a slant bar (to 
indicate the end of a valid cipher unit) after every occurrence of any 
digit which has appeared in a cipher doublet. 

(3) If the cryptanalyst is confronted with a monome-dinome cipher 
which is the result of encipherment by means of a commutative bipartite 
matrix (see subpar. 75d and the accompanying Fig. TJ), he knows that the 
first digit of the cryptogram must be a row coordinate. The analyst then 
has only to go through the cryptogram noting the digits which follow this 
row coordinate digit wherever it occxirs in the cryptogram and, in this way, 
he may be able to identify all uhe column coordinate digits. Of course, 
by the process of elimination, he will then know which digits are row 
coordinates besides the initial digit of the cryptogram, and it will then 
be possible for him to divide the text into its proper irregular-length 
ciphertext units. 


78 . Analysis of more complicated example s.—a. In some eases, the 
rather simple methods of analysis applied in the preceding paragraph will 
not boar fruit, either because of the complexity inherent m the number of 
plaintext elements In the cipher matrix, or because of certain unpredic¬ 
table abberrations caused by the particular designations of the row and 
column coordinates. For instance, if a specific matrix contained only 
the highest-frequency letters in the top row, and if the matrix contained 
a fairly large number of plaintext elements (and therefore embodied 3 or 
4, or more, row coordinates), and if the elements in the dinome rows were 
balanced from the frequency standpoint, so that the rows would be used 
with approximately equal frequency, and if furthermore certain of the 
columns were composed of heavier elements than others (thus producing 
peaks that might incorrectly he identified as row coordinates)—all these 
conditions would yield a cryptosystem that might pose considerable dif¬ 
ficulties in the way of straightforward analysis. A case will now he 
studied that will illustrate typical techniques that would he necessary 
in more difficult circumstances. 


b. The following cryptogram has been intercepted on an enemy net 
known"to be passing monome-dinome traffic: 


6 2 7 1 9 
0 9 6 3 7 
0 7 0 5 5 


4 4 0 8 1 
0 6 2 1 2 

5 8 7 1 9 


2 12 0 4 
2 4 7 1 2 
5 5 7 2 1 


7 12 7 0 
9 17 2 4 
0 4 10 9 


5 5 0 4 2 
21058 

5 2 8 4 7 


1 2 6 2 7 

12727 

7 12 9 7 


23571 82123 94578 77571 80581 97654 


7 4 5 7 2 

69051 

5 4 7 4 2 


0 5 19 1 
1 5 7 2 4 
78271 


7 7 19 4 
7 13 8 9 
7 2 3 2 7 


5 2 9 5 8 

4 7 3 1 6 

05504 


7 0 0 1 2 
7 9 0 3 5 
5 8 2 5 5 


1 2 2 5 1 

4 7 3 5 9 

5 5 9 1 8 
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The uniliteral frequency distribution for 


the cryptogram 1 b shown "below: 


%% u % 

% % 

gg ~g g s 

123 *i 56789$ 


c. From the appearance of the uniliteral frequency distribution, it 
is to""be expected that from among the four peaks (l, 2, 5, and 7 ) some row 
coordinates must "be represented, and since there is not much variance in 
frequency among these peaks, perhaps all four represent row coordinates* 

In an attempt to obtain as much Information as possible from a study of 
the frequency characteristics of the cipher text, a billteral distribution 
is made and is shown below. 


1 8 3 M 6 7 8 9 0 


1 

1 11 1-123353 

2 

7 233219132 

3 

1 1--3-111- 

4 

1 3*39-6521 

5 

3 2-39-6522 

6 

- 31-1-1-1- 

7 

10 722113217 

8 

3 3-1 — 3-11 

9 

3 --4412--2 

0 

1 -14-711121 



Examination of this latter distribution adds support to the impressions 
gained from the unilifceral frequency distribution, namely, that the row 
coordinates for the cipher matrix are very likely to he found among the 
digits 1, 2, 5, and 7« Furthermore the digit 7, because of its high fre¬ 
quency and because of satisfactory combinative qualities in the billteral 
distribution, is selected as a definite row coordinate—this will reduce 
the number of trials that must subsequently be considered. 

tr 

d. If all of the row coordinates of the cipher matrix are found among 
the various combinations of 7 with 1 , 2, and 5> then it is clear that: 

(1) if there are but two coordinates of the matrix, these must be 
either 7 and 1 , 7 and 2 , or 7 and 5 , (three cases): 

( 2 ) if there are three coordinates of the matrix, these must he 
either 7 - 1 - 2 , 7 - 1 - 5 , or 7 - 2-5 (three cases): or 

( 3 ) if the matrix has four numbered coordinates, this must entail 
the combination of 7 - 1 - 2-5 (only one case). 
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e. On the has is of each of the foregoing seven hypotheses, the cipher 
text Ts divided and the resulting frequency distributions are shown below: 


1 

J§L 


2 3 4 

.? ? 

J89 

jL 


17 1 usr 

5 l 4 

mwts 


- 

Q » P 9 

- 2 

212 

3 

2 

7 2 2 

- 1 

221 


7 




1234567890 

6 2 1-1 1 8 13 1 
66aa-i3ii3 


Casa XX 

1234567890 
nTT ---23132 

412121612- 
7 5 2 2 - 1 2 1 1 4 


Case 17 



Case V 


Case VI 



Case VII 

f. In order to be able to evaluate the relative merits of the seven 
hypotheses and choose the case which is most likely to be correct, it is 
possible to resort to a method wherein group frequencies of the high-fre¬ 
quency elements from each of the decompositions are studied. In the fol¬ 
lowing table drawn up for this purpose, the column of figures under "x" 
denotes the cumulative twelve bighest-frequency ciphertext omits; under 
M H", we have the actual frequencies of the first, the first two, the first 
three..., the first 12 highest-frequency ciphertext omits for each hypo¬ 
thesis (compare with the distributions In subpar. e)j in the adjoining 
column to the right of each "N" column, the various cumulative frequency 
values are expressed as percentages of the total number of ciphertext 
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I 

II 

in 

IV 

V 

VI 

VII 

P 

X 

H 

V 

158 

II 

II 

m 

H 

H 

157 

ll 

II 

iff 

3 

N 

138 


H 

139 

H 

N 

129 


hr 

31 

19 & 

30 

18 .£ 

25 

15.9 

29 

19.7 

l6 

11.6 

17 

12.2 

12 

9.3 

i 3 .o 

2 

48 

30.4 

48 

29.8 

46 

29.3 

44 

29.9 

29 

21.0 

31 

22.3 

24 

18.6 

22.2 

3 

64 

45.0 

64 

39.8 

60 

38.2 

57 

38.8 

42 

30.4 

43 

30.9 

34 

26.6 

30.2 

4 

77 

48.7 

79 

49.1 

73 

46.5 

67 

45.6 

51 

37.0 

54 

38.8 

4i 

31.8 

37.8 

5 

87 

55.1 

91 

56.5 

85 

54.1 

77 

52.4 

60 

43.5 

62 

44.6 

47 

36.4 

45.3 

6 

96 

60.8 

102 

63.4 

92 

58.6 

84 

57.1 

67 

48.6 

70 

50.4 

52 

40.3 

52.7 

T 

105 

66.5 

110 

68.3 

99 

63.1 

90 

61.2 

74 

53.6 

76 

54.7 

57 

44.2 

60.1 

8 

113 

71.5 

116 

72.0 

105 

66.9 

96 

65.3 

80 

58.0 

81 

58.3 

62 

48.1 

66.2 

9 

120 

75.9 

122 

75.8 

111 

70.7 

101 

68.7 

86 

62.3 

86 

61.9 

67 

51.9 

70.4 

10 

127 

80.4 

128 

79.5 

117 

74.5 

105 

71.4 

91 

65.9 

91 

65.5 

71 

55.0 

74.0 

ll 

133 

84.2 

133 

82.6 

122 

77.7 

109 

74.1 

96 

69.6 

96 

69.I 

75 

58.1 

77.f 

12 

137 

86.7 

138 

85.7 

127 

80.9 

113 

76.9 

101 

73.2 

101 

72.7 

79 

61.2 

80.5 


units which remain after the particular trial decomposition. The column 
labelled "P w gives the emulative theoretical frequencies of the first 
12 letters in English plain text. 

g, It is noted that in Case I, the most frequent ciphertext unit 
has a percentile frequency of 19 . 6 $, the highest two units, a percentile 
frequency of 30.4$; the highest three, a percentile frequency of 45.0$. 
When these percentages are compared with the percentile frequency of the 
highest-frequency letter in English plain text (13.0$), of the highest 
two letters (22.2$), and of the highest three letters ( 30 . 2 $), it is clear 
that Case I does not conform to the characteristics expected of a simple 
monoalphabetic substitution, therefore Case I Is not the correct division 
of the cipher text. Similarly, Cases II, III, and IV can also be rejec¬ 
ted because the cumulative values are much higher than the corresponding 
expectations for plain text. Case VII, on the other hand, demonstrates 
values much lower than the corresponding expectations for plain text; 
therefore this case too is rejected. This leaves only Cases V and VI, 
both of which show a close correspondence with plaintext expectations* 

h. If there were nothing else in the manifestations of the decom¬ 
posed - *cipher text in Case V and Case VI, these two cases would have to 
be tried in turn, making some tentative plaintext assumptions) of course, 
only the correct case would consistently yield plain text. However, there 
is an additional bit of reasoning which may be applied here as a means of 
deciding which of these two remaining cases is more likely to be correct 
and ought to be worked on first—namely, it may be reasoned that cipher 
text which has been decomposed according to an incorrect hypothesis will 
be likely to contain a larger ratio of monomes to dinomes than would the 
same text if it had been decomposed according to the correct hypothesis.9 


9 This intuitive reasoning has been borne out by empirical observation. 
30 monome-dinome ciphers of an average length of 100 digits were decomposed 
in all possible ways based on the hypotheses of two, three, and four row 

coordinates. In the case of _ of these ciphers, the correct decomposition 

yielded a monome-to-dinome ratio which was lower than the monome-to-dinome 
ratio yielded by any of the incorrect decompositions. 
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Case V has a monome-dlnome ratio of .916 whereas Case VI has a correspon¬ 
ding ratio of 1.043; thus Case V is indicated as the case which is more 
likely to he correct. 

i.. The cipher text is now divided according to the hypothesis of 
row coordinates of 1, 5, and 7, and the plain text is quickly recovered, 
facilitated hy the pattern of the first word, RECONNAISSANCE. The cipher 
matrix is reconstructed as follows: 



The reason for the high frequency of the cipher digit 2 is now seen: the 
combined frequencies of Ep, S p , and U p contribute to an inordinate peak 
for that column coordinate. 

In retrospect, several important points may he noted In the so¬ 
lution of this particular cryptogram. First of all, the four consecutive 
5 ’s in the last two groups of the cryptogram make it a very strong proba¬ 
bility that 5 is a row coordinate, otherwise the four 5’s would mean a 
threefold (or even fourfold) repetition of a monome letter, a compara¬ 
tively rare contingency. Secondly, the digit 1 could have been selected 
as a row coordinate with considerable certainty, based on the fact that, 
since the dincme 12 was the highest-frequency element in the bi literal 
distribution, it may be assumed that at least a number of 12's were causal 
and therefore 1 must be a row coordinate. In other words, the correct 
set of coordinates might have been established at the very beginning of 
the analysis, but for pedagogical reasons it was felt necessary to pro¬ 
ceed along the general lines of the solution as given. It is to be noted 
that, since at the start of solution we did not know exactly how many 
numbered row coordinates there were in this particular case, we could not 
apply the ratio of monomes to dinomes at once as the deciding criterion. 

k. If mixed-length systems were encountered in actual practice, 
after”"the type of matrix became known through solution of several days * 
traffic, solution of subsequent days* messages would be facilitated be¬ 
cause by this time the analyst would be familiar with the general type 
of matrix used. This knowledge would be of great assistance in making 
assumptions as to the nature of subsequent matrices. In some cases, the 
internal arrangement of the matrix might remain fixed, with only the co¬ 
ordinates being changed periodically, in other cases, the internal ar¬ 
rangement and the coordinates of the matrix might change, with only the 
size of the matrix remaining fixed. If it were known, for instance, that 
tii"e enemy were using a monome-dinome system with only two numbered row co¬ 
ordinates, then there would only be 10 x 9 or 45 exhaustive trials (if 

1x2 

these had to be made) which would be necessary to guarantee reaching the 
correct decomposition of the cipher text; if there were three numbered 
coordinates, then there would be a maximum of 10 x 9 x 8 or 120 trials 

1x2x3 

necessary to insure reaching the proper scheme for the decomposition of 
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the cipher text.10 such trials, although laborious (and ordinarily un¬ 
necessary) when made “by manual methods, would “be by no means prohibitive 
if there were available machine processes for assistance. Exhaustive 
trials would rarely be necessary, except in very difficult cases; in the 
majority of instances, straightforward methods of cryptanalysis would 
reduce the large number of theoretical trials to but a few, from which 
the correct selection could be made. 

l. If the exact composition of the internal arrangement of the 
matrix were known, this knowledge would be useful in determining how the 
letters of assumed cribs would be enciphered as xronoiros or dlnomes. In 
any case, if a word of pronounced idiomorphic pattern is assumed, no mat¬ 
ter bow the letters of the word are encrypted as monomos or dinomes, the 
idiomorphism must be patent in the cipher text; for example, the word 
ARTILLERY in a monome-dinoroe system must have a consecutively repeated 
monome or dinone representing Lp, closely flanked on both sides by some 
particular monome or dinoire representing Rp. If unoneiphered numbers 
were to appear in the encrypted text, bracketed by an indicator to signal 
that numbers begin and end, the recognition of these plaintext numbers 
would enable the analyst to identify the indicator, and thus, lead to 
the establishment of one row coordinate. 

m. It must be pointed out that mixed-length oystai3, even more so 
than other types of systems treated in this text, often present unusual 
problems for the cryptanalyst. Each case is a distinctly special case,^- 
but continued practice in the solution of these types of systems should, 
as in other situations, cultivate skill and develop abilities in this 
field. 

n. The student may have noted that no mention has been made con¬ 
cerning the possible use of the (j> tost as a means for determining whether 
or not a particular trial decomposition represents tl e proper reduction 
of a cryptogram to mono alphabet le term's. The <|> test has been ignored 
throughout this Section because, when dealing with cipher alpinists which 
include plaintext elements other than single lettois (e.g., such elements 
as syllables, numbers, indicators, etc.), the value of <{p can only he 
loosely approximated, furthonxxre, computation of the valve of <f> r in a 
mixed-length cipher is also a rather tenuous matter. For this reason, 

it has been considered best to describe only relhodo of solution which 
do not depend at all on the use of the i test, and thus keep from estab¬ 
lishing in the mind of the student any doubt as to the usefulness of this 
test when applied in other instances, such as those described in earlier 
sections of this text. 


10 The number of combinations of H things taken £ at a time is given 
by the formula jjC r - HI ; thus for the assumption of 3 numbered 

r!(N-r)I 

rows in a monome-dinome matrix, mCo s 10*9 876 5*^ 3*2*1 » 10*9*8 = 120 . 

3*2*1 (7 6*5 4*3*2*1) " - 372.1 
The notation HI is read as "factojial N." 

^ And, as one cryptowag has pointed out, some cases are more special 
than others. 
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79, Further remarks on cryptosystems employing Irregular-length 
ciphertext units ,—a. The subject of the diagnosis or identification of 
mixed-length cipher systems has not been discussed. This problem can 
sometimes be extremely difficult In complex cases; however, the general 
statement can be made that one takes advantage of any phenomena of repe¬ 
titions that axe present in a cryptogram to arrive at the conclusion that 
a mixed-length system has been encountered. If the repetitions present 
are separated by numbers of letters without a constant factor, or if the 
interval between repetitions is a prime number, and if the possibility 
of a null or nulls (of a different size than the real cryptographic units) 
has been considered and ruled out, then in all probability the cryptogram 
involves some sort of mixed-length cipher units. As to exactly which 
kind of mixed-length system is involved, this question can be answered 
only by detailed analysis, sometimes to the point of actual plaintext 
recoveries in order to be certain about one’s conclusions, 12 

b. It is not Imperative that a mixed-length cipher system be pro¬ 
duced - "through the medium of a matrix with row and column coordinates. 

For example, in one cryptogram that was submitted for solution, the cipher 
text began as follows: 


Q K T 2 Q 
K K T 2 K 


3 K B 3 K 
K T 2 K B 


QKTQK 
3 Q K T Q 


T 3 Q K T 
BQEK3 


2KB3Q 

KQ2QK 


K T Q R 2 


T 2 Q R 2..., 


The entire cryptogram, containing k$0 characters, consisted only of the 
seven symbols B, K, Q, R, T, 2, and 3 • When this cryptogram was solved, 
the following alphabet was recovered: 


A « K3 
B « KR3 
C « QB3 
D = KB2 
E * KB3 
F = KKT2 


G = KR2 

H * Q3 
IJ = QKT3 
KQ = K2 
L = KKT3 
M = QR3 


N a Q2 
0 » QR2 
P = OR 
R = QKT 
S = QB 
T = QKT2 


U = Q 
V = QB2 
W = K 
X = KB 

y - KET 
Z o KR 


To the reader who is a devotee of the royal game, it will be apparent 
that the foregoing alphabet is based upon chess notation.^ If however 
the digits 1-7 had been used in lieu of the symbols above, the cryptogram 
could still have been correctly divided into its component ciphertext 
groupings of 1, 2, 3, and 4 digits, based upon an interpretation of the 
characteristics present in the cipher text, and of the phenomena in a 
triliteral distribution showing one prefix and one suffix, 11 *’ 


12 Cf, the discussion of diagnosis in subpar. 69f» 

1 3 The chess-playing reader nngnt be interested m recovering the key 
word for this alphabet. 

ll+ The interested student could make up a cryptogram using seven 
characters in this fashion, so m coold see for himself the methods of 
attack on such a system. 
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c, The concept of irregular-length cryptographic units can he 
applied to many varieties of systems, both code and cipher* For example, 
in Fig* 89 , "below, there is illustrated a four-square matrix in which 
plaintext digraphs are represented by ciphertext dinomes, trinomes, or 
tetranomes. The positioning of the monomes in the ciphertext portions of 
the matrix was governed by the frequencies of individual components of 


A 

B 

c 

D 

E 

10 

12 

5 

3 

13 

F 

G 

H 

I 

K 

i4 

15 

16 

8 

17 

L 

M 

N 

0 

P 

18 

19 

4o 

6 

0 

Q 

R 

S 

T 

U 

42 

43 

9 

2 

7 

V 

W 

X 

Y 

Z 
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E 
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2 
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M 

N 
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U 
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W 
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Y 
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Figure 89 

four-square cipher digraphs,*5 thus permitting optimum compression of 
the cipher text, i.e*, allowing the most liberal vise of ciphertext dinomes 
and trinomes rather than the maximum cipher length of tetranomes; for 
example, the word REGIMENTAL would be encrypted RE GI ME NT AL. 

76 8l4 06 68 1018 

d. The matrix for another mixed-length cipher system, employing 
dinomes and trinomes for the encryption of plaintext digraphs, is shewn 


15 


See Appendix 2, Table 13, 


"Four-square individual frequencies." 
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in Figs* 90a and b. Using this matrix, the word DIVISION is encrypted 
as 07 883 32 7 k6.~ It is noted that consonant-vowel digraphs involving 
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eight high-frequency consonants with five vowels are represented "by di- 
nomes, and all other plaintext digraphs are represented hy trinomes. In 
those rare cases where, as in the example HD ZZ LB, an ''impossible" di¬ 
graph appears in the plain text, the insertion of the letter Kp in the 
plain text at that point m question, similar to the normal Playfair doub¬ 
let convention, enaoles the encryption of the word, as MU ZK ZL E* A 
better variation of the foregoing system might incorporate a dinone matrix 
for the 1*0 highest-frequency digraphs (comprising 1*2$ of English plain 
text) such as that illustrated in Fig. 91, and a trinome matrix modified 
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0 1 2 3 i> 6-.1-8,-_9 

AIT AR AS AT CO DE EA ED EE EN 
ER ES ET FI FO HI HT 10 IS LE 
MA HD HE HT OH OR OU RA RE RT 
SE SI ST IE TH TI TO TO TY VE 


Figure 91* 
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in suitable fashion for the remaining digraphs. Such a scheme would 
yield a greater condensing property for the cipher text, "but would not 
he as easy to use as the system described above since the easy mnemonic 
feature of the matrix in Fig. 90b would he lost. 

e. Another idea for a cryptosystem having irregular-length cipher- 
text groupings employs the diagram in Figs. 92e and h. This scheme in¬ 
corporates Playfair digraphic encipherment (with biliteral cipher equiva- 
• lents) and monographic encipherment (with uniliteral cipher equivalents). 
In order to disturb the regularity of us vial digraphic encipherment (pro¬ 
duced by the Playfair-type matrix 3n Fig. 92a), certain selected medium- 


B G M V ¥ 
W V M G B 


Figure 92a Figure 92b 

frequency consonants are enciphered monographically and uniliterally by 
the reciprocal alphabet shown in Fig. 92b. Using Fig. 92, as an example, 
the phrase BRIGADE OF EHEMY IEFAETRY MOVING.. .woul d be broken up and 
enciphered as follows: 

B RI G AD EO FE EE M YI EF AE TR YX M OX V IN G 
¥ PL V CE AR AF IF M UL SN FH YO ZY M QT G EH V 

The cipher text, regrouped into fives, 1 UPLVC EARAF LFMUL SEFHY OZYMQ TGKEV, 
reveals no indication of the uniliteral-biliteral encipherment involved. 
Since the letters BGMVW represent 8.2$ of normal plain text, there is ap¬ 
proximately 8$ interruption of the regularity of normal digraphic text. 
Furthermore, since it is expected that about half the time these letters 
will occur as singles in the plain text, and about half the time as In- 
terruptor letter (such as Xp in the example above) will have to he used, 
this scheme is accomplished by adding only about 4$ to the length of the 
original plain text. Other variations of the basic Idea are found m 
Figs. 93 and 94; in Fig. 93, the Playfair matrix is a 6 x 4 rectangle 
omitting S and Y, and these two letters form a reciprocal monographic 
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Figure 93 Figure 94 


encipherment convention; in Fig. 94, the Playfair matrix is the normal 
5x5, but with the convention that, unless Ep is the second member of a 
digraph in the process of encryption, Ep is represented monographically 
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"by J c . In the foregoing two figures, the SY of Fig. 93 could he replaced 
of course hy any other two letters whose combined frequency is in the 
neighborhood of 6-10$, and the monographic Ep of Fig. 94 could he replaced 
hy any other high-or medium-frequency letter. Instead of Playfair 
matrices, the digraphic portions of the enciphering schemes of this sub¬ 
paragraph could he accomplished hy the use of any other small-matrix 
digraphic methods. 


f. The Morse code, consisting as it does of irregular-length units 
composed of dots and dashes, lends itself to interesting cryptographic 
treatments. For example, the dots and dashes (and, of necessity, the 
spaces between Morse characters) might he encrypted hy means of the table 
illustrated in Fig. 95, wherein each of the three elements has approxi¬ 
mately the same number of variants. A better idea, however, is to employ 
variants in the proportions of dots (42.4$), dashes (29.1$), and spaces 


dot: 

dash: 

space: 


ABC DEFGHI 
JKLMNOPQR 
STUVWXYZ 


dot: HYDRAULICBE 

dash: FGJKMNOP 

space. Q S T V W X Z 


Figure 95 


Figure $6 


(28,4$) of the letters comprising normal English plain text; such a scheme 
for variants is shown in Fig. 96 . Thus, using the example of Fig. 96 , 
the word EIJE1IY (which in Morse code is • -• — ) might he en 

crypted as RS MDW CQ EFV PDCGZ, which would then he regrouped in fives 
for transmission. Other ideas for the encryption in digit form of Morse 
code systems might incorporate alphabets such as those illustrated in 
Figs* 97 and 98 below: 


dot: 12 3 4 

dash: 5^7 

space: 8 9 0 

Figure 97 


dot: 13579 

dash: 2 4 6 8 

space: 0 

Figure 98 


g. Space does not permit detailed examples of analysis of some of 
the foregoing systems. Admittedly, some of them would pose considerable 
difficulty in the way of solution, however, if these systems were used 
in actual practice, then operational cryptanalytic methods and entries 
would make possible successful solution. 

1 Further ideas of cryptosystems based on the Morse code will he 
treated in Military Cryptanalysis, Part IV. 
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SECTION XI 

niSCELLAUEOUS MOTOALPHABjETIC SXBEEMSji COrCLUDEIO BEMARKS 

J ' *" " ’ ’ Paragraph 

CryptosY^tems employing oyllabcuy squares and code charts.. 80 

Cryptoc^tcnis employing characters other than letters or figures. ... 8l 
Special remarks concerning the Initial classIfication of cryptograms.. 82 


DiPgm red secret conmamicat Lons. .. 83 

Concluding remarks..... 84 


P o. C ry ptosystems employing syllabary squares and code charts .—- 

a. Tne various cryptoSyrterns treated in the preceding sections of 

this text have m uho main fallen Into cither the uiultllltcral category 
or the polygrophlc category This and the next fev subparagraphs will 
xi cat of systems which represent a merger of these two categories— 
mnelv, bilitcrol systems which have as plaintext elements not only 
single letters end digits, but also certain polygraphs selected for the 
condensation in cipher text that their usage may permit In addition, 
treatment will be made of bllitcral systems whicn involve, as plaintext 
ami/3, a selection of frequent words (that Is, which occur frequently 
j.n the typo of traffic for waloh the particular cryptosystem is intended) 
and perhaps some common phrases, such as "reference your message number", 
"request acknowledgment", "nothing to report", etc. Systems which embrace 
digraphs, trigraphs and other polygraphs as plaintext elements In addition 
to single letters and digits are called syllabary systems because the ad¬ 
ditional Inclusion of these polygraphs permits the encryption of plain 
text in a syllabic or quasi-syllabic fashion, most systems of this type 
involve bipartite matrices in the cryptographic scheme, and these matrices 
sire called syllabary squares . When the matrix in this general type of 
system also incorporates words among the plaintext elements, the matrix is 
termed a code chart . r ~ . ’ 

l k 

b. The category of systems embodying syllabary squares and code 
charts as oho cryptographic vehicle actually constitutes a transition 
between cipher and code systems,^ since a syllabary square or a code 
cnaru may bo regarded equally properly as either a special type Of ci¬ 
pher or a primitive code. However, because syllabary systems follow 
very clo&cly on the Ideas of bipartite matrices, these systems are in¬ 
cluded in this particular text instead of being reserved for treatment 
in a subsequent text. 


■** See definitions of the terms cipher and code in the glossary. 
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c A sample syllabary square Is illustrated xn Fig 99, below: 


1 

2 

3 

k 

5 

6 

7 

8 

9 

0 


It will be noted that the square contains the 26 letters, the 10 digits, 
and 6h digraphs and trigraphs chosen both on the basis of frequency con¬ 
siderations and the combinative potentialities of the particular poly¬ 
graphs ; the internal arrangement of the square xs such as to permit the 
easy finding of the plaintext elements to be enciphered. Other matrices, 
of larger dimensions, may contain not only a larger number of different 
plaintext elements within the matrix, but may also duplicate some of the 
more frequent plaintext elements and thus incorporate plaintext variants 
within the matrix. Furthermore, when letters are used as coordinates, 
variant cipher equivalents may be incorporated into the scheme 

d. Typical of the many ideas that have been employed in the past 
for code charts is the chart which is shown in Fig. 100, below, and which 
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has "been used as a standard tactical cryptosystem for ground forces by 
AGCRESSOR, tbe maneuver enemy in U.S. joint roaneuveia and training ex¬ 
ercises. This chart provides 2-letter equivalents for letters, numbers, 
syllables, and a selection of words which occur frequently in low-echelon 2 
messages. A particular plaintext value may be designated by a combination 
of one of the two row coordinates and one of tbe two column coordinates 
of the cell containing the plaintext value; thus each plaintext element 
has four variant equivalents and, for example, the word ARTILLERY con¬ 
tained in the chart my be encrypted in toto as TF, TI, QF, or QI 
When a complete word contained in the chart is to be encrypted in a mes¬ 
sage, no designator is necessary to indicate this lowercase meaning How¬ 
ever, when upper-case meanings (i.e'., letters, number's, and syllables) 
ere to be encrypted, it is necessary first to encrypt the designator 
"Spell/fig Begins", followed by the cipher equivalents of the particular 
upper-case meanings; when the spelling is completed, the designator "Spell/ 
fig Ends " is encrypted, to show the return to lower-case meanings. The 
coordinates of the chart, as used by AGGRESSOR, were random sequences and 
were changed dally, tbe inside of the chart remained unchanged 

e For the most part, the stepB used in the recovery of plain texi 
from messages involving syllabary squares differ from those used in the 
solution of previously-discussed multiliteral and polygraphic systems only 
in that a larger number of plaintext elements may have to be considered. 

The cryptanalyst must accordingly modify his interpretation of the fre¬ 
quency characteristics and idiomorphic patterns occurring in such messages. 
By a careful study of the behavior of frequently recurring cipher units, 
the analyst is led to conclude that certain units, because of the general 
characteristics they exhibit, must be representative of numbers, others of 
punctuation, others of single letters, and so on. This classification is 
based upon a knowledge of the general behavior of the various classes of 
plaintext elements For example, expher units representing digits may be 
expected to appear in clusters (as in dates and time, and the designations 
of topographical features, such as hills, road junctions, etc.); whereas 
those which represent punctuation may be expected to appear at varying 
intervals throughout the message text (the particular intervals being de¬ 
pendent upon the particular punctuation mark) When this classification 
has proceeded upon a solid foundation far enough, each set of cipher units 
is underlined throughout the text in some distinctive manner by means of 
colored pencils. Subsequent to this, the individual members of each class 
of cipher units are subjected to closer scrutiny, and based upon a know¬ 
ledge of tbe specific behavior of the various elements in each class, 
specific units are identified as having specific plaintext meanings. For 
example, among those cipher units which the analyst has decided constitute 
tne class which represents plaintext digits, the particular cipher unit 


2 

The term low-echelon as applied to a cryptographic system means that 
the system is designed for use at the lower organizational levels such as 
(in the army) at the regimental level and below. The term low-grade as 
applied to cryptosystems means that the inherent security afforded by the 
system is low. Cf < the terms medium-echelon and -grade , and high-echelon 
and -grade. 
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representing plaintext "0" nay "be expected to be readily recognizable on 
the basis that (l) it is one of the tinee units which appear as the first 
unit in those clusters which are suspected of repieoenting four-digit 
time designations onfl (2) it is one of the two cipher units which, with 
any noteworthy frequency, occur doubled at the end of the some four-unit 
clusters. 

= .* "* 1 <■ < - w - -v * 

f When working on messages involving code chon to, the cryptanalyst 
usually starts by attempting to isolate sequences of cipher units which 
represent plaintext letters, syllables, numbers and punctuation Subse¬ 
quent to this he proceeds to classify and identify those particular cipher 
units in the manner described in the foregoing subparagraph; the recovery 
of word meanings is usually accomplished much later. The isolating of the 
cipheitext units which represent syllabary portions may be readily accom¬ 
plished in those cases wherein the underlying code chert has only one 
"Spell/fig. Begins" group and one "Spell/fig Ends" group, since the 
recognition of those designators automatically permits one to divide the 
ciphci text into word values and non-word values, the recognition of those 
designators is made on the basis of their high frequency and thoir alter¬ 
nating placements throughout the cipher text 

g. Ac plaintext meanings are recovered in a syllabary square system 
or code chart system, those meanings should be entered into a skeleton 
matrix in a manner similar to that used in the solution of the bipartite 
systems previously described (Sections VII and VIII). This is done in 
order to uncover and exploit as early as possible any evidences of syste¬ 
matic construction arising from the anangement which was used in tho under¬ 
lying matrix. It may be assumed that each syllabaiy square and code chart 
will normally have had Its internal elements arranged in some type of sys¬ 
tematic fashion in order to permit the ready finding of plaintext elements 
during the encryption of a message 

h. When there are special circumstances involved, for instance, when 
the contents or the exact internal construction of the matiix is knotm, or 
when the arrangement of the outside coordinates is known, or vh.cn messages 
with isologous syllabary portions (i.e., spellcd-out portions encrypted 
"off-the-cut", such as IN TER CE P TO R and I KT ER OE P T OR) arc present 
in the ciphor text, solution is naturally considerably facilitated. Even 
when only a single message Is available, if the matrxx is known there may 
be special appioacheo to solution, based on the nature of tho plaintext 
elements constituting each row and each column of the particular matrix. 

Eor instance, if the words REFERENCE and YOUR and ISSSASE are known to he 
in the same row of a particular code chart, then it would be quite possible 
that the cipheitext sequence LA LH LT at the beginning of a message repre¬ 
sents the stereotype REFERENCE YOUR j. LESS AGE, if but a few' other similarly 
identifiable sequences weie also available to the cryptanalyst, he could 
possibly recover uhe anangement of the outside cooidinates after a relative]/ 
few steps. „ t 

O l Cryptosystems employing characters other than letteis or figuaes _— 

a In practical cryptography today, the use of chaiacueis other than 
the letters of bona fide alphabets (including recognized Horse and Eaudot 

2.52 
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alphabets) or the 10 digits is comparatively rare. Whan so-called symbol 
ciphers, that is, ciphers employing peculiar symbols, signs of punctuation, 
diacritical marks, figures of "dancing men", and so on sac encountered in 
practical work nowadays, they are almost certain to he simple manoalphabetlc 
ciphers. They are adequately described in romantic talos,3 in popular books 
on cryptography, and in the more conunon types of magazine articles. No 
further space need he given ciphers of this type in this text, not only be¬ 
cause of their simplicity hut also because they are encountered in military 
cryptography only in sporadic Instances, principally in censorship activitiesl 
Even in the latter cases, it is usually found that such ciphers are employed 
in "Intimate" correspondence for the exchange of sentiments that appear 
less decorous when set forth in plain language. They are very seldom used 
by authentic enemy agents When such a cipher is encountered nowadays, it 
may practically always he regarded as the work of the veriest tyro, when it 
is not that of a crank or a mentally-deranged person. 

b. The usual preliminary procedure in handling such cases, where the 
symbols may be soraewnat confusing to the mind because of their unfeunlliax 
appearance to the eye, is to substitute letters for them consistently 
throughout the message and then treat the resulting text in the manner in 
which an ordinary cryptogram composed of letters is treated. This procedure 
also facilitates the construction of the necessary frequency distributions, 
which would bo tedious to construct by using symbols. 

o, A final word must be said_on the subject of symbol ciphers by way 
of caution When symbols are used to replace letters, syllables, and entire 
words, then the systems approach code raethor's in principle, and can become 
difficult of solution *■ The logical extension of the use of symbols in such 
a form of writing xs the employment of axbitraiy characters for a specially 
developed "shoi’thand" system bearing little ox no resemblance to well-known 
and therefore nonsocret, systems of shorthand, such as GregG, Pitman, etc. 
Unless a considexable amount of tgxt is available fox analysis, a privately- 
devised shox’thand may be very difficult to solve. Foi’tunatoly, such systems 
are rarely encountered in military cryptography. They fall under the heading 
_______ ■ a" * *r- 

3 The most famous: Edgar Allan - Poe's, The^Gold Bug ; Sir Arthur Conan 
Doyle’s The Adventux’e of the Dancing Men , Jules Verne’s A Journey to the 
Center of the Earth . - 

* ^ The use of oyfibols for abbreviation and speed in writing goes back to 

the days of antiquity. Cicero’s freedman and amanuensis, Tiro,is repoited 
to have drawn up "a book like a dictionary, in which he placed before each 
word the notation (symbol) which should zeprosent it, and so great was the 
number of notations and words that whatever could be written in Latin 
could be expressed in his notation." The designation "Tironian notes" is 
applied to this type of shorthand. 
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of cryptographic curiosities, of interest to the cryptanalyst in his 
leisure moments .5 

82 Special remonks concerning the initial classification of crypto ¬ 
grams .—n ’I ! he student should by this time have a good conception of the 

basic nature of monoalphabetic substitution and of the many variations 
■which may be played upon this simple tune The first step of all, natu¬ 
rally, is to bo able to classify a cryptogram properly and place it in 
either the transposition or the substitution class. The tests for this 
classification have been given and as a rule the student will encounter 
no difficulty in this respect. 

b. There are, however, certain kinds of cryptograms whose class 
cannot be determined in the usual manner, as outlined in par. 25 of this 
text. First of all there is the type of code message which employs bona 
fide dictionary words as code groups. Naturally, a frequency distribution 
of such a message will approximate that for normal plain text. The ap¬ 
pearance of the message, hovevei, gives clear Indications of what is in¬ 
volved The study of such cases will be taken up in its proper place. 

At the moment It is only necessary to point out that these are cod e 
messages and not cipher , and it is for this reason that in pars, 2F and 
25 the words "cipher’ <r and "cipher messages" are used, the word "crypto¬ 
gram" being used only where technically correct. 

£. Secondly, there come the unusual and borderline cases, including 
cryptograms whose nature and type can not be ascertained from frequency 
distributions. Here, the cryptograms arc technically not ciphers but 
special forms of disguised secret writings which are rarely susceptible 
of being classed a3 transposition or substitution. These Include a large 
share of the cases wherein the cryptographic massages are disguised and 
carried under an external, innocuous text which is innocent and seemingly 
without cryptogiaphic content—for Instance, in a massage wneroin spe¬ 
cific letters arc indicated in a way not open to suspicion under censor¬ 
ship, these letters being intended to constitute the letters of the 
cryptographic messages and the other letters constituting "dummies 
Obviously, no amount of frequency tabulations will avail a competent, 
expert cryptanalyst in demonstrating or disclosing the presence of a 
cxyptographic message, written and secreted within the "open" message, 
which serves but as an envelop and disguise for its authentic or real 
Import. Certainly, such frequency tabulations can disclose the e cistence 
neither of substitution nor transposition in these cases, since both fopas 
are absent. The next paragraph contains more about these latter cases 


5 

An example is found in the famous Pepys Diary, which was written m 
shorthand, purely for his own eyes by Samuel Pepys (1633-1703) "He wrote 
it in Shelton’s system of tachygraphy (l64l), which he complicated by 
using foreign languages or by varieties of his own'invention whenever he 
had to record passages least fit to be seen by his servants, or by ’all 
the world.'" 

^ The subparagraph which the student has just read (82c) contains a 
hidden cryptographic message. With the hints given in par. 83 let the 
student see if he can uncover it 
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63. Disguised secret coivtou nig at Ions —’a As was mentioned"* above, 

there i sa general class pi methods of secret wrll log in which a secret 
mess ago 10 concealed within the text of an apparently innocuous plain¬ 
text message, also, hy extension, a secret message may he concealed within 
otherwise hona fide media such as maps, drawings, charts, music manu¬ 
scripts, bridge hands, chess.problems, shopping lists, stock quotations, 
and so on The addressee of such a communication, knowing where to look 
for the secret elements, does so and from them is able to read the message 
contained within its covering disguise. When the plaintext elements of 
the secret message are concealed by surrounding them with the plaintext 
elements of an Innocent cover text, such a system 13 known as a conceal ¬ 
ment system When, however, the actual plaintext elements of tho secret 
message arc not themselves concealed within a cover text, but instead 
have code equivalents which are themselves actual plaintext words or 
phrases and which are used to form an apparently innocent message, such 
systems are called open code systems . 

b. An example of o concealment system message is the comsnunication 
"HAVE™ESTABLISHED LOW PRIORITY", in which the secret message "help" has 
been concealed as the first letter of each word of •Che covering text As 
an example of an open code, in the message ri AUNT MARY LEFT FOR DETROIT ON 
FRIDAY", the words AUNT MARY might stand for "five troop ships", DETROIT 
might mean "Southampton", and FRIDAY might stand for "Monday." An often- 
cited case of open code ip ttye message "A SON IS BORN", which allegedly 
was sent out by German-controlled radio stations all over the world in 
August, 191^, meaning that war was about to be declared 

L 

c. The solution of concealment systems may pose considerable diffi¬ 
culties for the cryptanalyst, who is placed in the rather odd situation 
where he might have before him a simple system, if he can but find thd 
system. Most of the statistical and other tools at the disposal of the 
cryptanalyst are of no avail to him in the attack on concealment systems. 
First of all, he might not even know whether or not a given letter docs 
contain a secret message, often the only reason for an examination of a 
particular message, other than a random sampling case, is that the origi¬ 
nator or the addressee is on a suspect list and therefore the communication 
is considered for possible secret writing. The difficulty^in analysis 

is usually "not brought about by the complexity of the system, for conceal¬ 
ment systems are almost always cryptographically simple. The ditficulty 
of the problem arises from the lack, at tho outset, of tangible crypto¬ 
graphic elements into which the cryptanalyst can "get his teeth" There 
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is ptimnrlly udc question of whether or not a secret text actually exists,? 
and, if it does, where ore the elements constituting the secret text As 
a consequence, locating the elements of the secret text and deriving the 
meaning of the secret text arc practically synonymous. Success in this 
type of analytic work requires extraordinary patience and perseverance, 
keen powers of observation nurtured by unrelenting suspicion, a lively 
imagination, exceptional ingenuity, and organised methods of analysis*— 
plus a firm foundation and considerable experience in the methods and “ 
practices of concealment systems 

d The number of different concealment systems possible Is enormous. „ 
The letters of the secret message might he concealed as th^ first, second, 
or third letters of the cover text, or they might he concealed as the*final 
penultimate, or antepenultimate letters of the words, or they might to 
concealed by means of a specific key into prearranged variable placements 
within the words of the innocent text. The secret text might he read by 
coxi3idei ing the letters which follow or precede all unnecessary breaks in 
cursive handwriting; or the secret text might he indicated by shaded 
letters or by pin pricks over significant letters, or even by elongated 
tails on words pointing to significant letters In the line above. In the 
analysis of such concealed-letter systems, it is advisable to write the 
successive words of the cover text one below the other. In a column, aligned 
by their beginnings and subsequently to rewrite them columnwise aligned by 
their endings, this will assist in disclosing a secret text hidden in a 
fixed position relative to the beginnings or endings, or in diagonal routes 
near those locations (see Fig 30l). It is also advisable to write out the 

- — — I T 


7 

In this connection, it is worthwhile to cite an extract from an official 
report prepared in 19^-6 by the wartime Office of Censorship; 

"Detection of concealed messages is based on the principle that 
there is no absolutely safe disguise for duplicity. Espionage letters 
have weaknesses and identifying characteristics, which modern techniques 
can minimize hut never completely eliminate. Seasoned examiners develop 
an ability to relate facts and think clearly about possibilities They 
develop a keen perception of, or alertness to, certain peculiarities, 
an attitude of suspicion toward certain indicators, and experience or 
training in handling certain types of materials._ _ , 

_ j ,v. . - - > fi i i 

"The texts of letters containing concealed messages do not ring 
true; they lack spontaneity, and the normal emphasis which people give 
to certain thoughts or ideas is absent Something comparable in social 
life is the stilted behavior and speech of a person who is obliged to 
entertain a stranger with whom he feels nothing in common, he behaves 
unnaturally, he desires to he polite, hut in order to do so he must 2 
hide his boredom and pretend an interest he does not feel. Exactly the 
same is true in the writing of cover texts or open code letters—the 
attempt to pursue two aims simultaneously results in strain. Skill 
and experience may overcome the strained-text hazard to a high degree, 
but they can never completely dispel the distortion and dislocation of 
a normal emphasis inevitable in a cover letter." 
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cover text in rectangu]ar arrangements of varjous widths, in order to dis¬ 
close secret text which might have been concealed in every nth letter of 
the entire cover text (see Fig, 102). In cases whore physical indicators 
are employed, such os breaks in handwriting or as shaded ] otters, an exami¬ 
nation of the letters in the immediate vicinity of such indicators iron]d 
disclose the secret text 

Cover text: * * ' Cover text: 

-- t -pp--■- 

UNCLE EZRA SEEMS DESPONDENT. * NHElfyOU SEE CHESTER AT 

HAVE YOU HEARD THE LAST REPORT? - MADISON'S HOUSE TELL HIM 

LOIS DEPARTED. 

W F E N Y 0 
USEEC H 
ESTERA 
T M A D I S 
0 N S H 0 U 
S E T E L L 
HIMLOI 
SDEPAR 
TED 

REPORT 


UNCLE 

EZRA 

SEEMS 

DESPONDENT 

HAVE 

“YOU 

HEARD 

THE 

LAST 


Secret text: NEED HELP 


Secret text: NEED HELP 


Figure 101. Figure 102. 

e. Some systems involve the concealment of entire words, instead of 
just individual letters, in the cover text Thus, for example, the secret 
text might consist of (l) every nth word of the cover text, (2) the first 
and last words of every line, (37 words preceding or following punctuation 
marks, (4) words bisected by an imaginary line running diagonally from the 
upper left to the lower right of the sheet of paper; or countless varieties 
of similar schemes. Grilles have also been used, the secret text being 
witten through the apertures of the grille on placed positions on the 
sheet of paper, and then a covering letter written to surround and cam¬ 
ouflage the secret text. In the solution of concealed-word systems, ex¬ 
amining the text produced by counting off every nth word may bear fruit; 
if the secret text is long enough, the validity of the assumed secret text 
may be proved by the consistency of the decimation. In cases wherein a 
variable key has been used to indicate which words constitute the secret 
text, proof of the assumed secret text may be impossible, unless the key 
is short compared to the message lengths, or unless additional messages in 
exactly the same key are available for comparison to test an assumed key. 


f. There have been many cases in which a secret text has first been 
converted into the dots, dashes, and spaces of the Morse code, or encrypted 
in a Baconian or a tripartite cipher; then this converted text was concealed 
within an innocent text in any one of the almost infinite number of possible 
ways Some of these ways in which the nruitiliteral elements of the pre¬ 
liminary conversion may be represented are by (l) the lengths of words. 


CONFIDENTIAL 



2 


" -* 


* 


* \ t 


t 


r » 



CONFIDENTIAL 


ID: A56892 


(2) the number of vowels or consonants in the words; (3) the number of 
ayllahles In the words; or (4) by the ways in which t's are crossed or JL's 
are dotted. The solution of such systems involves experimentation with” 
basic hypotheses concerning the manner in which raultllltcral elements are 
denoted, followed hy a recombination into monpalpbabctic terms (under the 
assumption of a Morse, tripartite, or Boconian system) and solving the 
1educed monoalphabctlc text Another method for a concealment system 
involves the use of a bipartite matrix employing coordinates consisting 
of vowels (or, for that matter, any othei set of five or six letters), 
the cecier text is first enciphered in this biliteral system, and then 
tnc vowels arc surrounded by consonants to form the plain text of an 
innocent cover message. As in most concealment systems, once such a 
suot etfuge is suspected or assumed , then and only ihen is solution 
po aside 

g The detailed discussion thus far has been limited to concealment 
systems ® In cases of open code, unfortunately there are no clear-cut 
juethodo of analysis or even of recognition, there is simply no rational 
way of proving that a message such as "AUIIT MARY LEFT FOR DETROIT TODAY” 
contains a secret meaning, unless it is known for a fact that the sender 
nas no aunt named Mary, and even then there still might exist a friend 
of the sender's who is affectionately called "Aunt Mary”—or, for that 
matter, she might be someone else *s aunt 9 And once having suspected 
or even proved that there is somithlng rotten in Denmark, proof of the 
content of the hidden meaning is simply out of the quesiion unless the 
serder is somehow convinced to mend his ways and thereupon volunteers 
the information In many wartime instances where open codes have been 
used, a legal case could not be proved against a suspect without his 
cooperation. 

h. A prominent case of the U3e of open code in espionage communi¬ 
cations is that of an Axis spy, Mrs. Velvalce Dickinson, who in August, 
1944, was sentenced in New York to ten years' imprisonment and was fined 
*10,CX after pleading guilty to the charge that a series of letters she 
had written to an agent in Buenos Aires in the early part of 1942 con¬ 
tained secret messages hidden in the plain text. These messages gave 
information regarding the location and condition of allied warships in 
Pacific ports These two agents professed to be dealers in antique dolls 
and used a prearranged code giving secondary meaning to words pertaining 
to the sale of dolls Mrs. Dickinson would send out letters advertising 
01 offering to sell certain of her antique dolls to the addressee. She 
would write the doll’s name and after the, name a brief description, then 
she would write, as in an ordinary business letter, the price of each 

.—q-— 

0 Further discussion of this subject will be found in Appendix 9, 
"Concealment Systems," 

9 In one instance, it has been related that a censor reviewed a tele¬ 
gram transmitted by a person on a suspect list The telegram read "FATHER 
IS DECEASED.” The censor, smelling a rat, changed the text to read "FATHER 
IS DEAD", and waited. Sure enough, several hours later came a query: “IS 
FATHER DEAD OR DECEASED?" ^ 
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doll The originoD cause for'suspicion was the eytrouv* variation in 
pi’ices ovei a range of three or four letters of what was apparently the 
same doll 01 the same type of doll A great runny lctfceis irei" necessary 
in order" to hulld up a cape sufficient to prove the uae of open code. 

It is doubtful even then that the use of open code could have been legally 
provon except_for’the fact thht / faced with so much evidence against her, 
she choso to confess this use. 1 * 

* I*. * * / '*.*< ' 

t 

J.. In addition to concealment systems and open codes, there axe 
three other methods for hiding the existence of secret text. These 
methods embrace the following:* - 

» < « 

(1) secret inks; 

(2) microscopic writing. Involving use of micropantographsj and 

(3) photographic methods, including "microdots" (i.e., the 
reduction of a page of copy to a negative the size of a miniature 
dot, which is then affixed on a period or on the dot of an "i"), 
double printing, double exposure, or'concealment within photographs. 

The methods of use and analysis of these systems, however, are beyond the 
scope of this text. ’ ~ ~ ‘ 

* j. 

8 k Concluding remarks --a The student will have by this time 
appreciated" that "monoaiphabetic substitution ciphers are for the mogt 
part quite easy to solve, once the underlying principles are thoroughly 
understood. As in other arts, continued practice with many examples 
leads to facility and skill in solution, especially where the student 
concentrates his attention upon traffic all of the same general nature, 
so that the type of text which he is continually encountering becomes 
familiar to him and its peculiarities or characteristics of construc¬ 
tion give clues for short cuts to solution It is true that a know¬ 
ledge of the general phraseology of messages, the kind of words used, 
their sequences, and so on, is of very great assistance in practical 
work in all fields of cryptanalysis In operational cryptanalysis,it 
is of vital importance to gain a knowledge of the language habits of a 
particular group of correspondents, to permit the rapid exploitation of 
the cryptosystem involved. Thus, at least initially, all possible traffic 
is cryptanalyzed, even tnat in simple systems and that of comparatively 
little intelligence value Word lists obtained empirically are of more 
value than "intuitive" or academic compilations, however, at the outset, 
reference may of course be made to these latter compilations 

b. Some of the simpler subterfuges which the student should be on 
the lookout for in monoalphabetic substitution are the following: 

X 

(l) There may be employed in the cryptographic scheme the con¬ 
secutive use of several different mixed cipher alphabets in a single 
long message Obviously, a single, composite frequency distribution for 
the whole message will not show the characteristic crest and trough appear- 


See in bhis connection the word and idiomorph lists comprising Appendix 3* 


C ONFIDENTIAL 


259 



REF ID:A56892 


CO NFIDE NTIA C 

ance of a simple monoalphabet 1c cipher, since a given cipher unit will 
represent different plaintext letters in different parts of the message. 

But if the cryptanalyst will carefully observe the distribution as it is 
being compiled , he will note that at first it presents the characteristic 
crest and trough appearance of monoalphabetieity, and that after a time 
it begins to lose this appearance If possible he should be on the look¬ 
out for some peculiarity of grouping of letters which serves as an indi¬ 
cator for the shift from one cipher alphabet to the next. If he finds such 
an indicator he should begin a second distribution from that point on, and 
proceed until another shift is encountered. __By thus isolating the dif¬ 
ferent portions of the text, and restricting the frequency distributions 
to the separate monoalphabets, the problem may be treated then as an ordi- 
, nary simple monoalphahetic substitution. 11 Consideration of these remarks 
in connection with instances of this kind leads to the comment that it is 
often more advisable for the cryptanalyst to compile his own data, than 
to have the latter prepared by clerks, especially when studying a system 
ab initio . For observations which will certainly escape an untrained 
clerk can be most useful and may indeed facilitate solution. For example, 
in the case under consideration, if a clerk should merely hand the com¬ 
pleted over-all uniliteral distribution to the cryptanalyst, the latter 
may be led astray, the appearance of the composite distribution might con¬ 
vince him that the cryptogram is much more complicated than it really is. 
While still on the subject of frequency disti ibutions, it is pointed out 
that, although earlier (par 43) the triliteral frequency distribution 
was cited primarily for its usefulness in extracting frequency data relative 
to the digraphs and trigraphs occurring in a simple substitution cipher, 
this particular type of distribution is used extensively in the manual 
attack on many other types of cryptograms because it provides one of the 
best means for systematically locating all of the repetitions which appear 
in a message. 

(2) There have been cases where direct and reversed standard al¬ 
phabets have been used alternately in a single cryptogram, the change of 
alphabets being made at Irregular intervals, or changed at the end of every 
word or with each group of five letters. If the interruption takes place 
at too short an interval, not only will a frequency distribution be of no 
avail, but also it would bo almost impossible for the cryptanalyst to de~ 
teimine when and how tho change of alphabets occurs from a mere examination 
of the cipher text However, if the cryptanalyst is on the alert to try tbe 
simplest tiling first f completing the plain-component sequence on the as- 
sumption of standard alphabets will yield a solution ’fchere otherwise a 
solution might he out of the question 

" £ 

(3) Another subterfuge that has been encountered is the encryption 
by means of a ironoalphabetic uniliteral substitution of a message whose 


11 The cryptanalyst should be on the alert for the possibility of re]ated 
alphabets in such a system, if tills is tbs case, the reconstruction oi th i> 
pillory co-poneuts from vhe solution of one - portion of the i y ould 

enable the reading of oilier per lions of the ireosrge by r nu o£ ■'b gei 'ro¬ 
ll lx l> ■'tl od til. ted ill jn 50 
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plain beet hae first becn'vrittcn T^^^lcwards (or for that matter, on ordinary 
simple substitution cipher sent bcck^ards) Ciphers of this typo my suc¬ 
cessfully resist the unsystematic attempts of solution which a tyro might 
make, however, the experienced analyst would probably quickly recognize the 
vcaic subterfuge if he wore to examine the frequencies of cipher digraphs, 
trigiaphs, and tetragraphs, in relation to the uniliteral frequencies of 
their component letters , . , 

c. KonoaDphabetic substitution with variants represents on extension 
of the basic principle, with the intention of masking the characteristic 
frequencies resulting fromstrict monoalphabeticlty, by means of x.hlch 
solutions oxq rather reodily obtained Some of the subterfuges applied on 
the establishment of variant^or multiple values aro simple and more or loss 
faxl to serve the purpose for which they are intended? others, on the con- 
tiaiy, may interpose serious difficulties to a straightforward solution. 

But in no cose may the problem be considered of more than ordinary dlffi- 
- " V'-ivwc it should be recognised that where the so subterfuges 

-re -« ij j j adequate to the purpose, the complications introduced are owh 
tnat the practical manipulation of the system becomes os difficult for tho 
cryptographer as for tne cryptanalyst 

(1) A few words my be added hero Jn regard to a method which 
c*fhen suggests itself_to laymen, but which is very old indeed in tho art. 
ibis errslets in using a book possessed by all the correspondent** and 
Oidicuting the letters of the message by means of numbers referring to 
■^cc3 + JLc lelteis in the book One way consists in selecting a certain 

pege and then giving the line nuubci and position of the letter in too 
3in-', tho page number being shown by a single initial Indicator Another 
wiy is to -the entire book;, giving the cipher equivalents in groups 
of three numbers repiescuLing page, line, ond number of letter (for example, 
75-8-10 means page r \ 5, 8th line, 10th letter in the Dine) Such systems 
are, however, extremely cumbersoi^ to use and, when tho cnciphoi-ing is 
done carelessly, can "be solved The basis for solution in such coses rests 
upon the use of adjacent letters on the same Dine, the accidental repe¬ 
titions of certain letters, end the occurrence oi unenciphorcd woids in 
the messages, when laziness or fatigue intervenes in the enciphering 

(2) It wry also oe indicated that human nature and the falli¬ 
bility of cipher clerics is such that it is rather rare for an encipherer 

b — 

In 1915 che German Government conspired with a group of Hindu revo- 
lutiorm10s to stir up a rebellion in India, the purpose being to cause 
tho withdrawal of British troops from the Western Front Hindu concpira- 
bors In tne United States'were given xronoy to purchase arms and emauuni- 
tjon end to transport them to India For communicatJon with their sup- 
e< Dora in Berlin tho compiratoi3 used, among others the eye ten described 
in tnxs paragraph A 7 _ P°S e typewritten lettei buiD b up Iron rage, line, 
and Dctter-numbei references to a book knoim only 10 the ccnwiuuicants, 
vis intercepted by the Briiish and'turned over 10 tne United Spates Gov- 
cj nm-*nt foi use in connection with tho prosecution of the Hindae for 
violating our pcntiald-jpy ’, J?he author Jp.T,¥jJ solved this message with¬ 
out the book in question, by taking full n&vante&e of the clues ref«red 
to ‘ ' 
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to make full use of the complement of variants^placed at his disposal. 

The result is that in most cases certain of the equivalents will he used 
so much more often than others that diversities in frequencies will soon 
manifest themselves, affording important data for attack hy the crypt¬ 
analyst ' t 

d There is one. aspect of cryptography within the realm of mono- 
alphabetic substitution ciphers that should he discussed at this point— 
the aspect involving repetitive roonoalphahetic substitution. 

(1) Suppose a message undergoes a primary encipherment hy means 
of a single mixed, non-reciprocal cipher alphabet, and this piimary ci¬ 
pher text then undergoes a secondary encipherment hy means of the same or 
a different mixed alphabet The resulting cryptogram is still monoalpha- 
betlc in character, and presents very little, if any, augmentation in the 
degree of security (depending upon the type of alphabet employed) .-*-3 Here 
an entirely illusory increase in security is involved and on ineffectual 
complexity is introduced, the process may indeed be repeated indefinitely 
without producing the desirable result of added security Similarly, the 
some illusory increase in security is present in the case of repetitive 
multiliteral encipherments involving regular-length ciphertext units, as 
long as the repetitive encipherments aie made "on the cut" . 

(2) In the case of repetitive polygraphic encipherment made on 
the cut, a moderate increase in security is achieved over the degree of 
security normally provided hy a single polygraphic encipherment. For 
instance, in the case of repetitive digraphic encipherment using,let us 
say, a four-square system for the first encipherment and modified Play¬ 
fair system for the second step, the final encipherment is still monoalpha- 
betlc digraphic in character, except that the* cryptosystem might have to 
he resolved as involving a more-or-less random square table, instead of 
being recovered in its primary and secondary steps, all the repetitive 
encipherment has accomplished is that it has added to the difficulty of 
reconstruction of the matrices used—hut this, in the ease of a digraphic 
system, is a reasonably fair increase in security, since we expect solution 
to be expedited through an early recovery of the matrix 

(3) When, however, successive multiliteral or polygraphic en¬ 
cipherments are made "off the cut" for the second step, the increase in 
security can he considerable, since the end result no longer exhibits the 
phenomena of monoalphabeticity and the eryptanalytic complexity of the 


13 *■ - - - 

The only possible slight increase in security lies in the fact that 

the key wox’ds for the primary and secondary encipherments might he made 

more difficult to recover or even impossible to recover 
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system has been thereby materially enhanced ‘*" + For example, using the 
two-oquare matrix illustrated in Fig 55 on page l 62 , the massage 
BEEHE* OECEi IEIITS EEEDFD undergoes the, following enciphermants* 

RE FN FO EC Eld Eli TS HE ED ED 

Cipher I IL DP UM CF KT DP GI UL DF DF 

\/V W V V V v\/ 

Cnpher II OC OT MC^ MR TD QF TO GC AH 

The first encipherment, IL DP UM. , is subjected to a second encipher¬ 
ment by considering tho digraphs "off the cut", resulting in the encrypt- 
msnt OC OT IE In the final cryptogram, the first and last letters of 

the primary encipherment may be retained os is, or they may be combined 
for the second encryption, for added security, thus the final cryptogram 
may read either IOCOT OCABF, or ROCOT . OCAHG When this sort of 
secondary encipherment is applied in a repetitive multillteral cipher, tho 
system is called a fractionating system 15 The cryptanalysis of these 
systems, which is often quite complex, will be treated in subsequent texts. 

e. If the cryptanalyst is fortunate enough to have a pair of icologs, 
one message of vnieh is in a monoslphabetic substitution system and the 
other in a transposition system, it may be possible for him to make exact 
identifications of the elements in the substitution cipher based on the 
plaintext letter frequencies present m the transposition cipher. Then, 
having the plain text, tne solution of the transposition is greatly facil¬ 
itated 


*i 2i 

A A rather ingenious idea proposed by Charles Eyraud in his excellent 
work, Piecis de Cryptogrophie lloderne , Paris, 1953 , PP 224-225, involves 
a repetitive cnclpneiment using two different monome-diuoire matrices. In 
Eyraud’s example, using tho two matrices illustrated, the plain text 


123456789 


S A IT T I R 
UDLFVQMPC 
HGOBXWJZK 


Matrix I 



w 

Matrix II 


E CRIT UR ES SE C R E T E S 

3 19 9 8 7 11 9 34 4 3 19 9 3"7 3"4 

31 9 9 8 7 ll 9 34 k 31 9 9 3T 34 

IAAZUCAQBIAAFQ 

•4 

’’ECRITURES SECRETES” is first enciphered with Matrix I, then the digits are 
recombined into letters using I’atrxx il with the resulting expner text 
IAAZU (It is interesting to note that the 17 letters of the plain text 
are encrypted by only l4 letteis in the final cipher*) The letter Yp is 
eliminated from Matrix I, and is included in .Matrix II to take care of a 
final 1 or 3 in tne fiist encipherment which otherwise could not have been 
encrypted as a single element. 

-*•5 See Appendix 7 for other examples 
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f As has already been stated in subpar 2c, mathematics and math¬ 
ematical methods have an important place in the art of cryptanalysis. This 
text has included only those Introductory statistical and mathematical 
applications which apply to monoalphabetic systems If it appears to the 
student that there has been a rather extensive treatment of too-specialized 
techniques, let him be reassured that these have been included as being in 
the nature of collateral information, rather than being an absolute''necessi¬ 
ty in the solution of the particular problems to which they were applied 
As a final wold of caution to the student the following extract from a re- 
poib by C . H. O’D. Alexander is Included: 

"There is a considerable danger that a learner, when he realizes that 
statistical methods can be of some use, will attempt to use them where they 
are quite inappropriate If he does this a few times and finds It gets him 
nowhere, he then gives the whole thing up as a waste of time and does not use 
such methods where he might There is also the worse danger of doing statis¬ 
tical test3 for their own sake so that they are used as a method of passing 
the time and avoiding real thought about the problem to be solved " 

g. The general problem of crypt analytic diagnosis has been discussed 
briefly in various Sections of this text The problem is far from simple, 
since many variations and conventions may be encountered in the various 
systems treated in this text, furthermore, the problem is made even harder 
by the fact that certain systems, themselves quite simple, may be combined 
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to produce a system much more difficult to diacnooe The lack of precise 
diagnostic tests, such as those available in the natural sciences,1*5 "is 
brought about by the fact that variations and conventions introduced into 
otherwise conventional systems may change radicolly the appearance and 
manifestations expected in the cipher text produced by the known systems, 
yielding "hitherto-unencountered phenomena." Each cryptosystem is then 
actually an individual and unique case in diagnosis IT 

(l) For example, four-letter cipher groups of the pattern con¬ 
sonant-vowel-consonant-consonant jLo not necessarily prove a code system, 
even though this grouping is a frequent one in four-letter code systems, 
the basic system might still be a cipher system, with the apparent 

Theauthor feels that it is of value to pursue further a discussion 
of how the science of cryptanalytics compares with some branch of one of 
the natural sciences, when the diagnostic procedures involved in each are 
considered. In that branch of biology called taxonomic botany, for example, 
the first steps in the classificatory process are based upon observation of 
externally quite marked diffeiences; as the process continues, the observa¬ 
tional details become finer and finer, involving more and more difficulties 
as the work progresses Towards the end of the work the botanical tax- 
onimist may have to dissect the specimen and study internal characteristics. 
The whole process is largely a matter of painstaking, accurate observation 
of data and drawing proper conclusions therefrom Except for the fact that 
the botanical taxonomist depends almost entirely upon ocular observation of 
characteristics while the cryptanalyst in addition to observation must use 
some statistics, the steps taken by the former are quite similar to those 
taken by the latter It Is only at the very end of the work that a sig¬ 
nificant dissimilarity between the two sciences arises. If the botanist 
makes a mistake in observation or deduction, he merely fails to identify 
the specimen correctly, he has an "answer"—but the answer is wrong He 
may not be cognizant of the error, however, other more skillful botanists 
will find him out But if the cryptanalyst makes a mistake In observation 
or deduction, he fails to get any "answer" at all, he needs nobody to tell 
him he has failed. Further, there is one additional important point of 
difference The botanist is studying a bit of Hature—and she does not 
consciously interpose obstacles, pitfalls, and dissimulations in the path 
of those trying to solve her mysteries. The cryptanalyst, on the other 
hand, Is studying a piece of writing prepared with the express purpose of 
preventing Its being read by "any“persons for whom it is not intended. The 
obstacles, pitfalls, and dissimulations are here consciously interposed by 
the one who encrypted the message. These, of course, are what make crypt¬ 
analytics different and difficult. 

^ Baudouin ( op cit ., ''Chapter XIV) drew up a sort of check list of the 
class if icatory proeedures r which an analyst might follow when attempting to 
diagnose the cryptosystem^'underlying*a"particular cryptogram or cryptograms . 
However, the science of cryptanalytics being as it is does not lend itself 
to successful completion of such diagnostic "check lists." Thus, the one 
compiled by Baudouin is far from satisfactory and is of no more than aca¬ 
demic interest to the present-day practicing cryptanalyst 
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characteristics of a code system. Upon closer examination, it might he 
possible to disprove a code system, based on the non-appearance of certain 
ofcher characteristics that should he present in a code system 

(2) If a cryptogram or a set of cryptograms contain only the 
letters A through 0 in the cipher text, all that can he said Initially 
is that only 15 letters are present in the encrypted text, and that 
the system must he one of substitution, either cipher or code. If a 
cipher, then the system must of course he a multiliteral system (including 
perhaps a mixed-length system), not excluding, for example, a digraphic 
system or a cor?'’ chart For instance, in the biliteral matrix below, 
the ciphoitext units consist only of pairs of consonants, and the plain¬ 
text elements include the 26 letteis and the 37 1 * most frequent digraphs; 
thus the system is essentially a digraphic system Such a system would 
not be at once recognized as a digraphic system, and if the vowels were 
used as nulls, the diagnosis of the cryptosystem would be considerably 
impeded 

h. The often extensive and elaborate treatment of the many varieties 
of cryptosystems within the scope of this text has not been given solely 
for the sake of the analysis of the particular systems involved, but rather 
to illustrate the general cryptanalytic techniques which are applied to 
various problems. In being guided along the lines of "thinking cryptanaly- 
tically", the student has been put in a position to analyze successfully 
many possible variations and modifications of the cryptosystems treated in 
this text and in the accompanying course. The cryptosystems in this text and 
accompanying course have been solved for the most part from one or two 
messages Naturally, there is a certain amount of artificiality in the 
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examples and messages e ployed herein The text? of messages have been 
manipulated, especially in connection with the accompanying problems, 
ill order to illustrate^pedagogical principles and the application of 
cryplanalytic techniques In actual practice, instead of the one or two 
messages, five might be required, or for that matter, fifty or more might 
be lequircd in'oider to effect a solution In operational practice, there 
is frequently a" high incidence of garbles which would have a pronounced 
impact on 56% only~a"facile identification of the crypiosystem but also 
on its subsequent solution Speed is an essential criterion in operational 
practice, a cryptosystem must be broken and massages reod as soon as 
possible, to bo of maximum use to a field commander—messages read six 
or twelve months after they were sent are hardly of more than historical 
importance Nevertheless, when a system is cryptanalyzed for the first 
time, no matter when it is broken it helps maintain cryptologic continuity 
which is of extreme importance in successful operational practice. 

i The student should now study, if he has not already done so, the 
various appendices to this text Through them, he may gain an insight into 
further aspects of cryptography and topics related to the art of cryptana¬ 
lysis Practice on many different ciphers of the types covered in this 
text will tend to sharpen the wits and give to the student confidence and 
facility in the cryptanalysis of unknown examples It 3s for this reason 
that a course of problems is a necessary adjunct to the study of this 
text; as was previously mentioned, one month's actual practice in solution 
is worth a whole year's mere reading of theoretical principles 

j_. It may be of assistance to indicate, by means of a graphic out¬ 
line, the relationship existing among the various cryptographic systems 
thus far considered The outline will be augmented with each succeeding 
text as the different cryptosystems are encountered, and will constitute 
what has already been alluded to In par 6d and there termed a "synoptic 
chart of cryptography" The synoptic chart for this text (Chart 9) forms 
an insert following this Section. Looking at this chart the student may see 
that, although it is essentially dichotomous in form, at several levels 
there appears a sort of cryptographic tertium quid—some category (or cate¬ 
gories) of cryptosystems which properly belongs at the particular level 
shown, but which does not directly fit into either of the two primary sub¬ 
divisions already appearing at that level However, if the student will 
study the synoptic chart attentively. It will assist him in fixing in 
mind the manner m which the various systems covered thus far are related 
to one another, and this will he of benefit in clearing away some of the 
mental fog or haziness from which he is at first apt to suffer. 


k. There remain five more volumes to this series of basic texts on 
the art of cryptanalysis Military Cryptanalytics, Part II, will treat 
mainly periodic polyalphabetic substitution ciphers, including periodic 
numerical systems. Part III will treat varieties of aperiodic substitution 
systems, including an introduction to elementary cipher devices and crypto¬ 
mechanisms, Part IV will treat transposition and fractionating systems, and 
combined substitution-transposition systems. Part V will treat the recon¬ 
struction of codes, and the solution of enciphered code systems, and Part VI 
will treat the solution of representative machine cipher systems . In addition 
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throughout the five remaining texts there will "be interpolated statistical 
techniques applicable to the systems treated, and information on the appli¬ 
cation of analytical machines in cryptanalytlc problems. The security 
classification of each succeeding text will vary according to the information 
contained therein. It is not intended that the student study all six texts; 
life is too short to become an expert cryptanalyst in all fields of the art. 
Parts I and II embrace most of the necessary fundamentals of cryptanalysis; 
the succeeding four volumes will impart knowledge on more specific categories 
of systems with which the cryptanalyst may be faced. , 
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Chart 7. Synoptic chart of cryptography for Military Cryptanalytica, Part I 
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l._ Thi b glossary Is designed primarily to be used in connection 
,. -With the text* Military Cryptanalysis, Part I . It is limited in scope" 
~to cryptologic terms actually appearing In the text, terms likely to 
* he encountered in other^ cryptologic literature of approximately the 
'scone level as the text, ^and a few other tezms'considered necessary to 
complement or 'to clarify certain definitions» _ 


i * ,2• The terms in, this glossary are arranged in strictly alpha- 

* betical order, disregarding word'spaces and'hyphens. Single words 
and certaip hyphenated words are followed directly by an abbreviation 
of the part of speech^ Run-on entries, indicating a part of speech 
different from that of the"main entry', are'shown simply by means of 
a series o£ dashes followed by the abbreviation of the part of speech, 
and the appropriate 'definition^ "Abbreviations used for parts of 
speech, as well as those used to indicate examples, cross-references, 
etc., are those listed in Webster's Hew International Dictionary, 
Second Edition* 
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GLOSSARY FOR MILITARY CRYFEAHALYSIS, PART I 


accidental repetition . A repetition produced fortuitously, and not by 
•the encipherment of Identical plaintext letters by identical keying 
elements* (Cf* causal repetition *) 

additive* n* A single digit, a series of digits, or a numerical group 

" which, for the purpose of encipherment, is added to a numerical 
code group or to numerical cipher or plain text* 

additive book * A book comprising a group of additive tables* 

additive system * A cryptosystem in which encipherment is accomplished 
through the application of additives* 

additive table * A tabular arrangement of additives* 

addressee* n* The office, headquarters, activity or Individual to vhom 
a message la directed by the originator* 


ADFOYX system * A German high-coinnand cipher system used in World War X* 
Essentially, a billteral substitution system employing a 6x6 square, 
to uhlch a columnar transposition vas subsequently applied* 


applique*unit, teleprinter , 
uion wiin a teleprinter 
printer messages. 


A special cipher attachment used in cqnnec- 
to provide cryptographic treatment for tele- 


artificial word * A group of letters having no real meaning, constructed 

.hy the systematic arrangement of vowels and consonants 89 as to give* * 

the appearance and pronounceahillty of a bona fide word* 

Baconian cipher. A cipher system invented hy Sir Francis Bacon (1561-1626), 


it is basically a monoalphobetic substitution system in which single 
plaintext letters are represented hy five-letter cipher equivalents 
formed by permutations of two letters taken five at a time* 

baud* n. A mark or space impulse in the international (Baudot) teleprinter 
code* 


Baudot code , A five-unit code applied to teleprinter systems by Jean Maurice 
Emile Baudot (1845-1903)• It employs a 32-element alphabet composed 
of permutations of two elements taken five at a time* Also called the 
International teleprinter code * 

billteral* adj. Of or pertaining only to cryptosystems, cipher alphabets, 
and frequency distributions which involve cipher units of two letters 
or characters* See the more inclusive term digrephlo j see also bi¬ 
llteral frequency distribution * 
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biliteral alphabet « A cipher alphabet Involving a cipher component com- 
pose# of two-ebaraeter units. 


— ' r j 1 V 

blllteral frequopcy distribution * ~ A frequency distribution of pairs 
formed by combining successive letters or characters. Thus, a 
- blllteral distribution of ABCDEF would list the following pairs: 
AB, BC, CD, EE, EF. (Cf. dlgraphlc frequency distribution .) 


, * i *- * , * ' v v 

bipartite alphabet . A blllteral alphabet In which the cipher units may 
be divided into two separate parts whose functions are clearly 
' defined, viz., row indicators and column indicators of a matrix. 


bipartite system . A substitution system Involving the use of a bipar¬ 
tite alphabet. 

blank-expectation test . See \arobtia. test . ’ * 

bust message . A message containing an error In encipherment which 
jeopardizes the cryptographic security of the message, and thus 
is potentially valuable to the cryptanalyst. 

V.*”* «■ * m. - 

Caesar*s cipher . An ancient form of simple substitution cipher In which 
each plaintext letter was replaced by the letter three places to 
the right of it la the normal alphabet; attributed to Julius Caesar. 

v — 

call sign . A group of letters or numbers, or a combination of both, 
used as^the Identification for a telecommunication station (or 
stations), when stations are establishing contact with each other. 

causal repetition . A repetition produced by tine encipherment "of identi¬ 
cal plaintext letters by identical keying elements. 

cell , n. An Individual small square oh cross-flection paper, grilles, etc. 

characteristic frequency . See normal frequency . v ~~ 

chi-square" ( ~)t) table . A mathematical table listing tfie probabilities 
of occurrence by chance of a chi-square value higher than that 
observed in a given case; an adjunct to tbe chi-square test . 

i;he rela- * 
same source. 

T?or example, the test can be used to aid in the determination of 
whether a distribution is more likely to be random or not; in this 
usage, the observed distribution Is compared with a theoretical 
distribution representing that which Is expected for random. The 
end result of the test is a value representing the discrepancy 
^.-between the two distributions which have been compared. This 
Value, called a M chl-square value” may be interpreted as it is, 

- 'or It may be Interpreted through the use of a chi-square table . 


chi-square (%) test . A mathematical means for determining 
tive likelihood that two distributions derive from the 
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chi ('Y) ‘test * A test applied to the distributions of the elements of 
two cipher texts eltliar to determine whether the distributions arc 
the result of encipherment by identical cipher alphabets, or to 
determine whether the underlying cipher alphabets ore related. 

Also called the cross-product test . 

clfax , n. Enciphered facsimile. The process of converting a plane Image 
into on unintelligible imago or series of electrical impulses ond of 
reconverting it or them into intelligibility through the use of a 
key.—adJ. Using or pertaining to cifax. 

cipher , n. 1. A cipher system. 2. A cryptogram produced hy means of a 
cipher system.—adj. Pertaining to that which enciphers or is en¬ 
ciphered, 

cipher alphabet . An ordered arrangement of the letters (or other con¬ 
ventional signs, or both) of a written language and of the characters 
which replace them in a cryptographic process of substitution. 

cipher clerk . A clerk who enciphers and deciphers messages. 

cipher component . The sequence of a cipher alphabet containing the symbols 
wixich replace the plain symbols in the process of substitution. 

cipher device . A nonmechanical and nonelectrical apparatus used for 
enciphering and deciphering. 

cipher disk . A cipher device consisting of two or more concentric disks, 

. each bearing on its periphery one component of a cipher alphabet. 

cipher machine . A mechanical or electrical apparatus for enciphering and 
deciphering. 

c^Eherjgusre. An orderly arrangement or collection of sequences set 
forth in a rectangular form, commonly a square (c.g., a Vigenere 
square) 

cipher system . Any cryptosystem in which cryptographic treatment is 

applied to textual units of regular length , usually monographic or 
digraphlc. (Cf. code system .) 

cipher text . The text of a cryptogram which has been produced hy means of 

* a cipher system. 

ciphony , n. Enciphered telephony. The process of converting vocal commun¬ 
ications into unintelligibility and of reconverting them into intel¬ 
ligibility through cryptographic treatment.-—adj. Using or per¬ 
taining to ciphony. 

cltrol, n. The process of converting control and telemetering signals, 
such as those used in missile guidance, into unintelligibility and 
reconverting them into intelligibility through cryptographic treat¬ 
ment.— adj. Using or pertaining to citrol. 
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clviolon, n. Enciphered television. A system of converting television 
signals into unintelligible signals and vice versa, in accordance 
with certain predetermined procedures.---adj. Using or pertaining 
to civision. 

clear text . Plain text. ' 4 

code , n. 1. A code system. 2. A code book.—adj. Pertaining to 

” that which encodes or is encoded. 

* . ' - . - 

code book . A book or document vised in a code system, arranged in syste¬ 
matic form, containing units of plain text of varying length 
(letters, syllables, words, phrases, or sentences) each accompanied 
by one or more arbitrary groups of symbols used as equivalents in 
massages• 

i 

code chart . A chart in the form of a matrix containing letters, syllables, 
numbers, words and, occasionally, phrases. The matrix has row end 
column coordinates for the purpose of designating the plaintext 
elements within. 

code clerk . A clerk who encodes and decodes"messages." 

code group . A group of letters or numbers, or a combination of both, 
assigned (in a code system) to represent a plaintext element. 

code message . A cryptogram produced by encodement. 

code system . A cryptosystem in Which arbitrary groups of symbols represent 
plaintext units of Irregular length, usually syllables, whole words, 

' phrases and sentences. 

code text . The text of a cryptogram which has been produced by means of 
a code system. 

coincidence test . The kappa test. A statistical test applied to two 

ciphertext messages to determine whether they both involve encipher¬ 
ment by the same sequence of cipher alphabets. 

columnar transposition . A method of transposition in which the cipher- 
text equivalent of a message is obtained by transcribing the columns 
of a matrix Into which the message was inscribed earlier according 
to some scheme other than this vertical one. 

column coordinate . A symbol normally at the top of a matrix or crypto¬ 
graphic table, identifying a specific column of cells, used in 
conjunction with a row coordinate to specify an individual cell in 
the matrix or table. Also called column indicator . 

column indicator. See column coordinate. 
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comnmnicabion Intelligence (CQMIMl) . Evaluated and Interpreted Informa¬ 
tion derived from the study of intercepted communications. 

communication security (COMSEC) . The protection resulting from all 
measures designed to deny to unauthorized persons information of 
value which might be derived from communications. Cryptosecurlty , 
transmission security , and physical security are the components of 
comraunication security. 

commutative , odj, As applied to cipher matrices, so constructed as to 
permit coordinates bo bo read in either row-column or column-row 
order without cryptographic ambiguity. 

component , n. One of the two sequences (plain and cipher) which compose 
a cipher alphabet. 

compromise , n. The loss of security of a classified document, information, 
or material, which results from the possibility of an unauthorized 
person or persons having knowledge thereof. 

concealment system . A method of secret communication so designed as to 
convey a secret message without its presence being suspected by 
others than the addressee. In its most usual form, the plaintext 
elements are concealed by combining them with extraneous plaintext 
elements in such a way that the end result is an intelligible and 
apparently innocent message. (Cf. open code .) 

crest , n. In its cryptologic application, a point of high relative fre¬ 
quency in a frequency distribution. 

* 

crib , n. 1* Plain text assumed or known to be present in a cryptogram. 

2. Keys assumed or known to have been used in a cryptogram.—v.t. 

To fit assumed or known plain text or keys into the proper position 
in an encrypted message. 

cross-product test . See chi test . 

cryptanalysis , n. The steps and operations performed in applying the 
principles of cryptanalytics. 

cryptanalyst, n. A person versed in the art of cryptanalysis. 

cryptanalytic , adj. Of, pertaining to, or used in cryptanalytics. 

cryptanalytics , n. That branch of cryptology which deals with the prin¬ 
ciples, methods, and means employed in the solution or analysis of 
cryptosystems• 

cryptanalyze, v.t. To solve by cryptanalysis. 
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cryptogram ,' n. A communication in visible -writing -which conveys no 

intelligible meaning lu any known language, or which conveys some 
meaning other than the real meaning, 

cryptographer , n. One who encrypts or decrypts messages or has a port 
in making a cryptographic system, 

cryptographic , adj. Of, pertaining to, or concerned with cryptography, 

cryptographic ambiguity . Uncertainty as to the method of decryption or 
as to the meaning intended after decryption; created, by a fault in 
the structure of a cryptosystem, 

cryptographic arithmetic . The method of modular arithmetic used in 
cryptographic procedures which involves no carrying in addition 
and no borrowing in subtraction, 

»-'• J ' ■ S *■ « f' 1-i 

cryptographic security . See cryptoseeurlty . 

- , t ■■ « 

cryptographic system . See cryptosystem . 

* - r _» f K " m 

cryptographic text . Encrypted text; the text of a cryptogram, 

** f r 

h ^ 

cryptography, n. That branch of cryptology which treats of the means, 
methods, and apparatus fpr converting or transforming plaintext 
messages into cryptograms, and for reconverting the cryptograms 
Into their original plaintext form by a simple reversal of the 
steps used in their transformation, 

cryptologic, adj. Of, pertaining to, or concerned with cryptology, 

** i 

cryptology, n. That branch of knowledge which treats of hidden, disguised, 
or encrypted communications. It embraces all the means and methods 
of producing communication Intelligence and maintaining communication 
security; for example, cryptology Includes cryptography, crypt- 
analytics, traffic analysis, eto. 

cryptomaterial. n. All documents, devices and machines employed In 
encrypting and decrypting messages, 

eryptomathewatlclan , n. One versed in cryptomathematics. 

oryptomathcmatlcs, n. Those portions of mathematics and those mathematical 
methods'/which have cryptologic applications. 

cryptoseeurlty , n. That component of communication security which 

results from the provision of technically sound cryptographic systems 
and from their proper use. 

cryptosystem , n. The associated ltemc ox oryptomaterlal and the methods 
and rules by which these Items are used as a unit to provide a 
single means of encryption and decryption. A cryptosystem embraoes 
the general cryptosystem and the spocifie keys essential to the 
ewployccient of tne 'general cryptosystem* 
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cyclic, ad$. Periodic; continuing or repeating no that the first term 
of a series follows the last; characterized by a ring or closed- 
chain formation, 

cyclic permutation . Any rearrangement of a sequence of elements which 
rearrangement merely involves shifting all the elements a con/ton 
distance to the right or left of their Initial positions in the 
sequence, the relative order remaining undisturbed; such a 
rearrangement requires that one consider the basic sequence as 
being circular in nature so that, for example, shifting that 
element which occupies the left-most position in the sequence one 
place to the loft places this element in the right-most position, 

dally keying element . That part of the specific key which changes at* 
predetermined intervals, usually dally, 

decimated alphabet . An alphabet produced "by decimation, 

decimation , n. The process of selecting members of a series by counting 
off at a chosen interval, the original series being treated as 
cyclic; or the result of the foregoing process. 

«* # - 

decimation-mixed sequence, A mixed sequence produced by decimation, 

^ t ^ , *■ .Hi 

decipher , v.t. To convert an enciphered message into its equivalent plain 
text by a reversal of the cryptographic ^rohess dbed in encipherment, 
(This does not include solution by cryptanalysis.) 

v 1 lr ■“ » *»' *.•» » /k » i ■* 

r f ’ * 

deciphering alphabet . A cipher alphabet in which the sequence of symbols 
in the cipher component is arranged in normal order for convenience 
in decipherment. 

. .6 m*. t y - „ v’ ' -•, j 

decipherment , n. 1. The process of deciphering. 2. The plain text of 
a deciphered cryptogram. 3* In an enciphered code system, the code 

text resulting from the removal of the encipherment. 

\ 

decode, v.t. To convert an encoded message into its plain text by moans of 
a code book. (This does not include solution by cryptanalysis.) 

—n. 1. That section of a code book in which the code groups are in 
alphabetical, numerical, or other systematic order. 2. The decoded, 
but not translated, version of a code message. 

decodomont , n. 1. The process of decoding. 2. The decoded, but not 
translated, version of a cryptogram. 

j 

decrypt , v.t. To transform an unintelligible or cryptic communication into 
an intelligible one by a reversal of the cryptographic process used in 
enpryptment. (This does not include solution by cryptanalysis.)—n. 

A decrypted, but not translated, message. 

decryption, n. The act of decrypting. 

degnrble , v.t. To make emendations in a garbled text. 
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derived nurnei teal Key * A key produced "by assigning numerical valvies to 
a selected literal key. 

diagnosis, cryptarnlytlc . A systematic deamination of cryptograms with 
a vj .cut to discovering the general system underlying theso crypto¬ 
grams. 

di graph , n. A pair of letters. 

dlgrapblc , adj. Of or pertaining to any combination of two characters. 

r „ I 

dlgrgphlc frequency distribution . A frequency dlstrilwtion of successivo 
palis of letters or characters. A digraphic distribution of ABCDOSF 
would list the pairs: AB, CD, EF, (Cf. bi literal frequency distri ¬ 
bution .) 

digraphic ldiomorph . A plaintext or cipher sequence which contains or 
shows a pattern in its construction as regards the number and 
position of repeated digraphs. 

digraphic substitution . Encipherment by substitution methods in which 

the plsinlcxt units are pairs of characters and their cipher equiva¬ 
lents usually consist of two characters. 

dinewe, n. A pair of digits. 

direct standard cipher alphabet . A cipher alphabet in which both the 
plain and cipher components are the normal sequence, the two com¬ 
ponents being Juxtaposed in any of the non-crashing placements. 

discriminant , n, A group of symbols indicating the specific crypto¬ 
system vised in encrypting a given message. Also called system 
indicator . 

distribution , n. See frequency distribution . 

doublet , n. A double-letter digraph, such os, LL , EE , etc. 

double tianspositlon . A cryptosystem in which the characters of a 
first or primary transposition are subjected to a second trans¬ 
position. 

encicode, n, A portmanteau word for enciphored'code « ' 

encipher , v.t. To convert a plaintext*message into unintelligible 
"language by moans of a cipher system. 

- -> » 

enciphered code . A cryptographic system in which a cipher system is 
applied to encoded text. 
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enciphering alphabet * A cipher alphabet in which the sequence of letters 
in the' ‘ plain" component is arranged in normal order for convenience 
in encipherment* 

encipherment, n* 1. The process of enciphering* 2. Text which has "been 

-"enciphered. “ 

encoded cipher * The final text produced by enciphering the plain text and 
then encoding the enciphered text. 

encode, v.t. To convert a plaintext message into unintelligible language'' 
by means of a code book,—-n. That section of a code book in which 
the plaintext equivalents of the code groups are in alphabetical, 
numerical, or other systematic order* 

oneodement , n. 1. The act or process of encrypting plain text with a 
code system. 2* The text produced by encoding plain text* 

encrypt , v.t. To convert a plaintext message into unintelligible language 
by means of a cryptosystem. 

encrypted text . The text produced by the application of a cryptosystem to 
a plaintext message. 

encryption , n. l. The act of encrypting, 2. Encrypted text. > 

external text . In concealment systems, the apparently innocent enveloping 
text within which a secret message is hidden. 

four-level dlnome cipher . A biliteral substitution cipher system employ¬ 
ing four cipher sequences composed of two-digit numbers, by means > K 
of which all or nearly all of the plaintext letters are provided 
with four two-digit variant equivalents." 

four-square matrix system . A digraphic substitution system employing a 
matrix which" usually consists of four 5x5 squares in which the 
letters of 25 -element alphabets (usually combining I and j) are 
Inserted according to any prearranged order. ^ 

r , - rn t + - — m 

fractionating system . A cipher system injwhich plaintext units are . 
represented by two or more'cipher symbols which in turn are dis¬ 
sociated and subjected to further encipherment by substitution or 
transposition or both. ^ ^ 

_ i. 

f ractionation , n. A cryptographic process wherein the cipher symbols, 
which combined represent a plaintext unit, are dissociated and sub¬ 
jected to further encipherment. 

- i * k ^ - * 
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frequency distribution * A tabulation of Ihe frequency of occurrence 
of plaintext oi* ciphertext units in a message or a group of 
messages• A frequency couni• 

frequent" lal matrix . A type of cipher matrix providing ■variants. A 
matrix In which the number of different cipher values available 
to represent any given plaintext letter closely approximates lbs 
relative plaintext frequency. 

garble , n. An error in tranemission, reception, encryption, or de¬ 
cryption "which renders incorrect or undecryptahle a message or 
transmission or a portion thereof •—v.t. To note an error in 
transmission, reception, encryption, or decryption of a message. 

general cryptosystem . The basic invariable method of encryption 
included in a cryptosystem, excluding the specific keys 
essential to its^ employment. 

- * ~ » * 

generatrix , n. In connection vith the method of completing the 
plain component sequence, any one of the rows, each of which 
represents a trial "decipherment 11 of the original cryptogram. 

Grandpre cipher . A type of "substitution system providing variants. 
Tliis system employs a cipher square in which are inscribed ten 
10-letter words containing all the letters of the alphabet in 
their eiproximate plaintext frequencies. These ten words are 
further linked together by a 10-letter word which appears 
vertically in the first' column ss a mnemonic feature for the 
inscription of the words in the rows. 

grid, n. In a transposition system, a form or matrix over which a 
gyille is placed for the purpose of enciphering or deciphering. 

grille , n. 1. A sheet of paper, cardboard, thin metal, plastic, or 
like material in which perforations have been made for the uncov¬ 
ering of spaces in which textual units may be written or read 
on the grid. 2. A matrix in which certain squares are blocked 
out or otherwise marked so as not to be used. 

group , n. A number of digits, letters or characters forming a unit 
for transmission or for cryptographic treatment. 

high-echelon , adj. Pertaining to organizational units at the army 
divisional level or higher, or their equivalents, 

high-grade , adj. Pertaining to a cryptosystem which offers a maximum 
of resistance to cryptanalysis; for example: (l) complex cipher 
machines, (2) one-time systems, (3) two-part codes enciphered 
with an additive book. (Cf. low-grade and medium-grade.) 
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H113 *s algebraic encipherment . A true polygraphic system for the encipher¬ 
ment of polygraphs of any order, involving algebraic treatment by 
means of coefficients for the transformation of a plaintext poly¬ 
graph into its ciphertext polygraphic equivalent, and vice versa. 
Invented by Professor Lester S. Hill of Hunter College. 

hit, n. A coincidence or identity . 

horizontal two-square matrix system . A digraphic substitution system em- 
ploying a! matrix which normally consists of two 5 x 5 squares placed, 
side by side. 

W — * _ 

identification, n. Determination of the plaintext meaning^ of a cipher 
element or code group. “ ( r r n _ 

identify , v.t. To determine the plaintext meaning of a cipher element or 
code group. , ",.. . - ’ 

ldiomorph , n, A plaintext or cipher sequence which contains or shows a 
pattern in its construction as regards the number at^d positions of 
repeated letters. ' __ 

“ N» - . jj 

idlomorphism, n. In a plaintext or' cipher sequence, the" phenomenon of 
showing a pattern as regards the number and. positions of repeated 
letters. ‘ 1 ’ ~ ' pu 

r * T ^ 

index of coincidence . The ratio of the observed number of coincidences in 
a given cryptogram to the number of coincidences expected in a sample 
of random text of the same size as the cryptogram. 

Indicator, n. In cryptography, an element Inserted within the text or 
heading of a message which serves as a guide to the selection or 
derivation and application of the correct system and key for 
the prompt decryption of the message. See also the more precise 
terms discriminant and message indicator . 

inscription , n. In a transposition system, the process of writing a mes¬ 
sage into a matrix. 

Integer, n. A whole number. 

intercept, v.t. In its cryptologic application, to gain possession of 
communications which are intended for other recipients, without 
obtaining the consent of the addressees and without preventing or 
ordinarily delaying the transmission of the communications to those 
addressees.—n. A copy of a message obtained by Interception. 

interception, n. The process of gaining possession of communications in¬ 
tended for others without obtaining tho consent of the addressees 
and without preventing or ordinarily delaying tho transmission of 
the communications to those addressees. 
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internal text * In concealment systems, the secret text vhlch Is envel¬ 
oped by open or apparently innocent text. 

international teleprinter code . See Baudot code, 

interrupted-key columnar transposition . A columnar transposition system 
in which the plaintext elements are inscribed in a matrix in rows of 
irregular length as determined by a numerical key. 

inverse four-square matrix system . A four-square matrix system in which 

the cipher sections contain normal alphabets while the plain component 
sections contain mixed alphabets. 

Invisible writing . Writing not visible to the naked eye; the characters 
composing suich writing may be microscopic or inscribed with invisible 
ink. 

Isolog, n. A cryptogram of which the plain text is identical with that of 
another message encrypted in another system, key, code, etc. 

Isologous, adj. Pertaining to or having the nature of an Isolog. 

Jefferson cipher . A polyalphabetlc substitution system Invented by Thomas 
Jefferson and independently at a later date by the French cryptographer 
Bazeries. It provided for encipherment by means of a manually operated 
device Involving a number of revolvable disks, each bearing a mixed 
alphabet on its periphery. 

kappa plain constant . A constant employed in coincidence tests to denote 
the probability of coincidence of a given textual element or unit In 
plain text. It is the sum of the squares of the probabilities of 
occurrence of the different textual elements or units as they are 
employed in writing plain text; for example, in English telegraphic 
plain text, the monographic and digraphic kappa plain constants are 
♦0667 and . 006 $ respectively. 

kappa random constant . A constant employed in coincidence tests to denote 
the probability of coincidence of a given textual element or unit in 
random text. It is merely the reciprocal of the number of different 
elements or units of which the cipher text may have been composed; 
if a 26 -letter alphabet were employed, for Instance, the constant 
denoting the probability of coincidence of various textual elements 
wound be derived as follows: 

a. single letters 1/26 ■ .0385 

b. digraphs 1/676 * .00148 

c. trigraphs l/l7576 * .OOOO 57 

kappa test . See coincidence test . 

key, n. 1. In cryptography, a symbol or sequence of symbols applied to 
successive textual elements of a message to accomplish their encryp¬ 
tion or decryption. 2. A specific key. 
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key "book* A book containing key text, or plain text forming specific keys. 

— - *r 

keyed columnar transposition . A transposition system In which the columns 
of a matrix ore token off In the order determined by the specific key, 
which Is often a derived numerical key. * _ n - 

key phrase . An arbitrarily selected phrase from which a key is derived. 

n * * * i 

key recovery . The cryptanalytic reconstruction of a key. 

key text . Text from which key is derived. - 

* r«. 

key word . An arbitrarily selected word used as a key per se , or from which 
a key is derived. 

f K r * -* r» f t-S* 

keyword-mixed alphabet . An alphabet constructed by writing the prearranged 
key word or key phrase (repeated letters, if present, usually being 
omitted after their first occurrence), and then completing the se¬ 
quence from the unused letters of the alphabet in their normal sequence, 

I -*’■ * " * P 4 * I 

lambda (A) test . A test for monoalphabeticity in a message, based on a 
comparison of the observed number of blanks in its frcqufefccy distri¬ 
bution with the theoretically expected number of blanks both xn 
(a) a normal plaintext message of equal length and (b) a random 
assortment of an equal number of letters. Also called the blank- 
expectation test . "• t »V'V rr~ 

t i 4 j_ l ul ' ,t,«t '* | **■ 

latent repetition . A plaintext repetition not apparent in cipher text 
L but susceptible of being made patent as a result of analysis. 

■ *■ -* ■*1 I r*r - , r 1 T ’Vif f 

Latin square . A cipher square in which no row nOr column contains a 

repeated symbol. - ■ *-'»»-* ' ->.» *‘ - 

*4 i ^ >, i i/ 4 if i 

lexical , adj. Of, pertaining to, or connected with words • In its crypto¬ 
logic sense, the word is used to characterize those cryptographic 
methods (chiefly codes) which deal with plaintext elements comprising 
complete words, phrases and sentences. 

r - _ * -ft Jt * yj I mr *• 61 - 1 ” ■* P ^ 

literal key . A key composed of a sequence of letters. 

1 r m V«j j t- f- — j *|* 

logarithmic weights ♦ numerical weights assigned to units of plain text, 
which weights are actually logarithms of the probabilities of the 
plaintext units, and which are used to evaluate the results of certain 
cryptanalytic operations. #• “- * *- ' nr 

low-echelon , adj. Pertaining to organizational units below the level of 
the army division or its equivalent. 

low-grade , adj. Pertaining to a cryptosystem which offers only slight 
resistance to cryptanalysis; for examples (l) Playfair ciphers, 

(2) single transposition, (3) unenciphered one-part codes. (Cf. 
medium-grade and high-grade). 




T -f 
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matrix, n. A geometric form or pattern. In tianspositton systems, 
the figure or fliagrom in which the varies steps of the trans¬ 
position ore effected; in substitution systems, the figure or 
diagram containing the sequence or sequences of plaintext or 
cipher symbols. 

medium-grade, adj. Pertaining to a cryptosystem which offers con¬ 
siderable resistance to cryptanalysis; for example: (l) strip 
ciphers, (2) polyphase transposition, (3) unenciphered two- 
part codes. (Cf. low-grade jand high-grade ). 

message, n. Any thought or idea expressed In plain or secret lan- 
gunge, prepared in a form suitable for transmission by any 
means of communication. 

message Indicator . That part of the specific key which changes with 
every message. 

message keying element . See message indicator . 

mixed cipher alphabet . A cipher alphabet In which the sequence of 
letters or characters in one or both of the components Is not 
the normal sequence. 

mixed-length system . A cryptosystem in which the units of cipher 
text or code text are of Irregular or non-constant length, as 
for example, a znonome-dlnome system, or a code system employ¬ 
ing both k -letter and 5-letter groups. 

mnemonic key . A key so constructed as to be easily remembered. 

modulo , adj. Pertaining to a cyclic scale or basis of arithmetic. 
(Abbreviated as mod ; e.g., mod 10, mod 26, etc.) 

modulus , n. Scale or basis of arithmetic; the number n is called 
the modulus when all numbers which differ from each other by 
n or a multiple of n are considered equivalent. 

monitor, v.t. To intercept and copy one’s own or friendly radio and 
wire transmissions for the purpose of detecting and correcting 
violations of regulations. 

- JL 

monoalphabeticity , n. A characteristic of encrypted text which indi- 
cates that" it has been produced by methods involving a single 
Cipher alphabet or single code book, unenciphered. It is 
normally disclosed by frequency distributions which display 
"roughness", or pronounced variation in relative frequencies. 
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fflcmortlp ho bo fclg. _sxtbstituhion , A type Of substitution 'employing O oinKl 0 

ciph<n* alphabet 'by irrcans of which each cipher equivalent, 00wpan?9.^ c "' 
of one or icoro clcmenbs, invariably represents one particular plaia- 
text utii b , vhei*over it occurs throughoxxt any given message * 

f 

monographic* adj, of or pertaining to any units comprising single charac¬ 
ters » 

k 

monogtaphlc substitution » Encipherment hy substitution methods ia wh£ch 
the plaintext xuxlts are Bingle characters and their cipher equiva- 
lents usually consist of single characters* 

TOonomc * n* A single digit* *" * " 1 

monomc^dinome system * A substitution system in which certain plaintext 
elements' have single-digit cipher equivalents, while Others are _ 
represented by pairs of digits. 

multiliteral , adj* of or pertaining only to cryptosystems, cipher alpha- 

rrrr ' *' Vcl s"/ and frequency distributions which involve cipher units of two 
or more letters or characters *_ _S.ee the more inclusive term poly - 
foaphic « 

it K i ifa— a ^ 

multiliteral cipher alphabet « A cipher alphabet In which one plaintext 

letter is repreVented b'y cipher units comprising two or more elements« 

multiliteral system * A substitution system involving one or more multi- 
liter al cipher alphabets, 

4 

multiple alphabet system , A type of substitution in which successive 

lengthy port ions’ of a message are each mo no alphabetically enciphered 
by a different alphabet; monoaiphabetic encipherment by sections, 

non-carrying bxto , A sxm produced in cryptographic (mod 10) arithmetic* 

non-crashing * adj* A term xxsed to describe that feature of the structure 
of cex*tain cryptosystems which does not permit a plaintext unit to be 
self-enciphered, 

- ^ t 

non-coraiutative * adj. As applied to bipartite cipher matrices, so con¬ 
structed that row and colximn coordinates must be read in a certain 
prescribed order (for example, in a row-column order), 

normal frequency . The standard frequency of a plaintext unit or letter 
relative to other sxich units or letters, as disclosed by’the statis¬ 
tical study of a large volume of text, J - 

normal sequence . The normal alphabetical sequence of those letters which 
' 'are' used in the written text of any particular language, or any cyclic 
permutation thereof. 
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normal unlllteral frequency distribution . A distribution showing 
the standard rointivc frequency of single plaintext symbols as 
disclosed by statistical study of a large ■volume of text. 

null, n. In cryptography, a symbol or unit of encrypted text hewing 
no plaintext significance. 

numerical key . A key composed of a sequence of numbers. 

numerically-keyed columnar transposition . A transposition system in 

. . . 1 ■-ph'jich tlie^co'iumns of a matrix are taken off in the order deter¬ 
mined by a numerical key. 

Off the cut . As .applied to the division of cipher text into poly-' 
graphs, beginning elsewhere than with the initial character of 
a bona fide polygraph. 

•f » * * ^ 

one-part code . A code in which the plaintext elements arc arranged 
in alphabetical or numerical order accompanied by their code 
groups also arranged in alphabetical or numerical order. 

one-time pad . A form" of key book used in a one-time system so de¬ 
signed as to permit the destruction of each page of key as 
soon as it has been used. 

one-tire system . A cryptosystem in which the key, normally of a 
rondom nature, is used only once. 

on the cut . As applied to the division of text into polygraphs, be¬ 
ginning Tilth the first textual character. 

open code . A cryptosystem in which units of plain text are used as 
the code equivalents for letters, numbers, words, phrases or 
sentences. The code equivalents themselves, usually words or 
phrases, can be combined to form the intelligible text of 
apparently innocent messages. (Cf. concealment system .) 

originator, n. The individual (a commander or his officially desig¬ 
nated representative) by whose authority a message is sent. 

padding , n. Extraneous text added to a message for the purpose of 
concealing its length and beginning or ending or both. 

paraphrase , v.t. To change the phraseology of a message without 
changing its meaning. 

partially-polygranhic system . Any polygraphic substitution system 
in which the encipherment of certain members of the poly¬ 
graphs shows group relationships; small matrix systems, such 
as the four-square, two-square and Playfair sybt- 13 involve 
such group relationships end are considered to 1 e portially- 
digrapliic syct^s. 
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pari 11Ion , n. Resolution of an i nieger Jwto a sot of integers (e.g«, 
rtpresentatIon of the integer 6 as 1 and 5> 2 and 4, 3 and 3)• 

“ "■ # n A mr » 

»r 

patent repetition . A repetition which is externally visible in the 
original cryptographic text. 

( - r-J If * r f | 1 i r * ""*■ I*\ ^ f ^ 

ffont o graph , n. A set of five letters. " * cr 

pent arowe , n. A set of five digits, ' 1 ** 


periodic siibstitutio n. Periodic polyalphabctlc substitution. A method 

of cncipEcrjuonl involving the cyclic use of a plurality of alphabets. ^ ^ 

m ■*“ » *• • 

permutation table . A tablo designed for the systematic construction of 
code groups. It may also be used to comect garbles in groups of 
code text. * * - J *' ~ ' T>fet r^rr •_,» ? • „ * * 

r “' |j'" 

phi ( $) toot. A test allied to a frequency distribution to dotermixip'i^bs * 
relative monoalphaboticity. See also happa plain constant and* kayp a^ 
random constant. ' * * 


—P * I- 


phy sic al securi ty. That component of communication security which results 
~ ~frbixi all physical measures necessary to safeguard”Classified contorani- 
cation equipment and material from access thereto by unauthorized 
persons. f 


placofl e, n. 
code. 


A portmanteau word used to elosignate plain or untnclphercd 

-». *1 T ^ A * 


p lain code • Unenciphered code * 


^ j- 




A V > 


That component of a cipher alphabet which comprises the 


sequence of plaintext symbols. 


plain compo ne nt eq u ivalents . In connection with the method of completing 
the plain coBponenThcquence, the plaintext equivalents for cipher 
units derived from an arbitrary Juxtaposition of the components of a 
cipher alphabet. 


plain language . Plain text. ' - < 


plain tex t ( clear tea t). 1, Text or language which conveys an intelligible 
rti£.oning in tSe^languagc in which it is written, with no hidden meaning. 
2. The intelligible text underlying a cryptogram. 

! , f I~ ^ - V \ r 1 f *■ 


sy stem . A type of digraphic substitution using a single matrix 
hornnily of 25 cells. 
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Poisson table . Table of the Poisson distribution. A type of math- 
cnAtical table containing probability data applicable to the 
phenomena of repetitions expected to obtain in samples of random, 
text; used in cryptanalysis to determine whether or not the 
repetitions observed in a given sample of cryptographic text are 
causal repetitions or accidental (random) repetitions. 

polyolphabotic substitution / A type of substitution in which the 
successive plaintext elements of a message, usually single 
letters, are enciphered by a succession of different alphabets 
which may bo used more than once and which care used in a 
predetermined order. 

polygreThic , ad3« Of, pertaining to, or connected with any group¬ 
ings comprising two or nore letters or characters. 

** 

polygranhic substitution . Encipherment by substitution methods in 
which* the plaintext units are regular length groupings of more 
than one element. 


polyphase cnciphcrrent . Any system of encryption involving two or 
more successive operations of encipherment• 

probable word . Plain text assumed or known to he present in a crypto¬ 
gram. A crib. 

probable-*; rord method . The method of solution involving the trial of 
plain text assumed to be present in a cryptogram. 

- i 

proforma message . A message in standardised form, designed to convey* 
intelligence by conventions of arrangement and abbreviation. 


pseudo-code system . A cipher system which produces a cryptogram whose 
groups resemble those produced by a code system. 

pseuAo-polyftcrphic system . A polygraphia substitution system in Which 
at least oaeofuHe letters in each polygraph is enciphered mono- 
aJLphabetically. ~ ~ 


quinqueliteral alphabet . A cipher alphabet in which each plaintext 
fetter Is. represented by a 5-character equivalent. 

1 ^ / ^ jrXi- to * * * ' i , 

random-mixed cipher alphabet . A cipher alphabet in which the letters 

comprising the' plain or cipher component have been mixed at random. 
(Cf. systematically-mixed cipher alphabet ). 

random text . Text which appears to have been produced by chance or 
accident, hsvipg no discernible patterns or limitations. 
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rapid analytical machinery . Any high-speed cryptanalyblc machinery, 

usually*electronic or photoelectric in nature. * ' ~ ” * 

r, r t. — 

> 

raw traffic . Intercepted traffic showing no evidence of processing for 
communication intelligence purposes beyond sorting by dear address 
elements, elimination of unwanted messages, and the inclusion 6 $ a 
case number end/or an arbitrary traffic designator. 

reciprocal cipher alphabet . A cipher alphabet in which either of the tyo 
sequences may serve as plain or cipher since the equivalents e^jiibit 
reciprocity. 

reciprocity , n. As used in cryptology, interchangeability of plain-cipher 
relationships (e.g., Ap = B c and Bp = Ac). 

' ' 1 • * " - r 

related alphabets . Any of the several secondary cipher alphabets which 
are produced by sliding any given pair of primary components against 
each other. 

relative code . Code text from which an encipherment has been removed in 
relative terms but not reduced to plain-code text, so that the groups 
differ from the actual, original plain code by an interval constant 
for every group; thus the difference between two relative code groups 
is the some as that between their plain-code equivalents. 

repeating-key method . See periodic substitution . 

repetitive encipherme nt. A typo of encipherment in which the primary cipher 
texFof a cryptogram is subjected to further encipherment with either 
the seme or a different system. Double transposition is a frequently- 
encountered example of repetitive encipherment* > 

rever s ed standard cipher alphabet . A cipher alphabet in which both the 
""plain and cipher components are the nbnra.1 sequence, the cipher com¬ 
ponent being reversed in direction from the plain fcdmjionent. 

reversibility , n. That characteristic of the relationship het^repn a plain¬ 
text' digraph and its cipher digraph equivalent which permits the 
elements of each to be reversed (e.g., ABp • CDc and BAp •• DC c ). 

revolving grille . A type of Grille in which the apertures are so distri¬ 
buted that when the grille is turned successively through four angles 
of 90 degrees and set in position on the grid, all the cells on the 

grid are disclosed only once. Also called rotating grille.* 

'' ' 1 " m i ’ 

rotating grille . See revolving grille . : 

rotor, n. A disk vhich is designed to rotate within a cipher machine and 
which controls the action of some other machine component or produces 
a variation in some textual or keying element. 
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roughness , n. That characteristic of a frequency distribution where 
"There is displayed in the distribution a pronounced v^n-tlon in 
relative frequencies of the elements considered,(Cf. s )othncao ,) 

route transp o si-Hon . A method of transposition in which the cipher- 
text equivalent of a message is obtained by transcribing, accord¬ 
ing to any prearranged route, the cells of a matrix into rhich 
the xrossege was inscribed earlier according to sore other pre¬ 
arranged route. 

row coordinate . A symbol normally at the sade of a matrix or crypto¬ 
graphic table, identifying a specific row of colls, used in con¬ 
junction with a column coordinate to specify an individual cell 
in the matrix or table. Also called row lndicctor , 

row Indicator . See row coordinate . 

running dinore distribution. A biliteral distribution nede on digit 
text. 

secret ink . Any of several chemicals used for writing or printing which 
hove the property cf being initially invisible to the n^hed eye or 
of becoming so after a short time. Also called invisible ink or 
sy pathetic ink . 

secret jo^nwage . Text which conveys no intelligible meaning in any 
l^gu-ge or which conveys an intelligible meaning that is not 
the real, hidden meaning. 

secret writing . 1. Visible writing in secret language. 2. Invisible 
writing. 

separator, n. See word separator . 

sequence, n. " An ordered arrangement of symbols (letters, digits, etc.) 
having continuity. Specifically, the members of a component of a 
cipher alphabet in order; the symbols 5n a row, column, or diagonal 
of a cipher square in order; key letters or key figures in order. 

r 

setting, n. The arrangement and alignment of the variable elements of 
a cryptographic device or machine at any moment during its opera¬ 
tion, ' ’ - 

slprr'^ e.n. As used in cryptomathematics, a measure of the standard 
deviation from normal, expressed in terms of sigma (<5*). 

simple substitution . Monoalphabetic unillteral substitution. 

simple transposition . See single transposition . 
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fc ' c ^ 

single transposition . A transposition In ■which only one Inscription and 

one transcription ore effected, “ - _ 

« lj; 

smoothness, n. That characteristic of a frequency distribution where there 
is displayed in the distribution no pronounced variation in relative 
frequencies of the elements considered* (Cf• roughness ) 

solution, n. In its cryptanalytic application, the process or result of 
solving a cryptogram or cryptosystem by cryptanalysis, 

solve, v,t. To cryptanalyze. To find the plain text of encrypted communi¬ 
cations by cryptanalytic processes, or to recover by analysis the keys 
and the principles of their application, 

specific key . An element which is used with a specific cryptosystem to 
determine the encipherment of a message and which includes both the 
me ssage keying element and the daily keying element . It may consist 
of a letter, number, word, phrase, sentence, a special document, book, 
or table, etc., usually of a variable nature and easily changeable 
at the will of the correspondents, or prearranged for them or for 
their agents by higher authority, 

% h 

square, n. See matrix . 

standard cipher alphabet . A cipher alphabet in which the sequence of 
letters in the plain component is the normal, and in the cipher 
component is the same as the normal, but either reversed in direction 
or shifted from its normal point of coincidence with the plain com¬ 
ponent. 

standard uniliteral frequency distribution . See normal unillteral fre ¬ 
quency distribution . 

stercoty ne, n. A word, number, phrase, abbreviation, etc., which as a 
result of language habits, hen a high probability of occurrence, 
especially at the beginning or ending of a massage. 

- t 

stereotyped messages . Related encrypted messages which are recognizable 
as such because of distinctive characteristics of the underlying 
plaintext. *»- „ 

strip ci rb' r device . A cipher device employing sliding alphabet strips. 

substitution alphabet . See cipher alphabet . 

r * r "“ *" ^ ii f ^ i 

substitution cipher . 1. A cipher system in which the elements of the plain 
text are replaced by other elements, 2. A cryptogram produced by 
enciphering a plaintext message with a substitution system. 

* - - • i T- 

substitution system . A system in which the elements of the plain or code 
text ore replaced by other elements. 
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svi-cI^^ViT ^ ri i■ a , . A prose! cct/:d digit (normally the final digit) 
in - cc ** 01 oil licr guoup \ htch i*j the no”-carrying even of 
tli'' other digits an the group. 

suBnirg-trircro *n -m . A substitution system In which crch pi -In- 
text letter I'. - -olcrcd. *» unique nr ertcalvc”’O of 0 to ??. 
Thas voluo is then expressed as a trinone, tiiO digits of winch 
sum to the designated, value of the letter. 

r 

swporenel pliermont, n. A form of nuporencryptlon. in which the final 
step involves cnciphcrrent. —v„t, Pup^rorcipher . 

supcrencryptaon, n. A further encryption of the text of a c^pfco- 
grnn for increased security. Enciphered code is a ftxqu 'ntly- 
encountercd example of superencryption. —v .t. Sun' > r~ncrynt . 

switch croup . A group used within a message to indicate that the 
following textual elements are encrypted with a different key 
or code hook. 


syllabary, n. In a code took, a list of individual letters, combi¬ 
nations of letters, or syllables, scconpenicd by their equiva¬ 
lent code group's, usually provided for spelling out -word3 or 
proper names not present in the vocabulary of a code; a spell¬ 
ing table. 


syllabary square . A cipher matrix containing individual letters, 
digits, syllables, frequent digraphs, trigraphs, etc., which 
are encrypted by the row and column coordinates of the matrix. 


syllabl e, adj. Of, per L'In In g to, or denoting syllables. 

— l 4 


system, n. See cryptosystem . 


system t leal ly-mixcd cipher glphrbet . A cipher alphabet in which 
the corponent - tiiat is maxed Thao been disarranged by systematic 
procedure. (Cf. rpi»flom-iri>ed cipher alphabet ) - 


system indicator . See discrjnnnont . 

r- - 

teleprinter, n. An electrically-operated instrument resenJbling a 
typewriter, used lor the transmission and reception-printing 
of messages by electrical means. Also called teletypewriter . 

„ * 

teletypewriter , n. A teleprinter . 

' 6 * . 

tetragreph, n. A set of four letters, 
tetranome, n. A set of four digits. 
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igriy n. She part of a message containing the basic infannaxlon which the 
originator desires to ho coimamjcotcd. * 

. 

traffic, n. All transmitted connnmications. 

traffic analysis * That trench of cryptology which, through a "study oF 
signal transmissions hy all means short of cryptanalysis of message 
textb, assedbles information coneoming cotimuxucation networks* This 
information Is used (l) as a guide to further Interception; (2) as an 
aid. to cryptanalysis; (3) am a source of Intelligence even In the 
absence of decrypted, message texts; and. (4) to strengthen our own 
security by discovering weaknesses in our communications end by 
avoiding weaknesses discovered m the communications of others* . 

traffic Intercept . A copy of a communication obtained, through Interception* 

transcription, n. In fe, transposition system, the process of removing the, 
text from a matrix or grid by a method or route different from that * 
used In the Inscription* * » 

transmission security . That component of communication security which 
results' from al3 measures designed to protect transmissions from 
interception end traffic analysis* . . it..* * * 

' » 11 ^ r •• F 

transparency, direct . That characteristic of cipher text which Indicates 
That certain plaintext elements may have been self-enciphered* 

_ - -1 « “■ rti» ■■ t . |. 

transparency, inverse. That characteristic of cipher text which Indicates 
that certain cipher digraphs, may be merely reversals of the correspond¬ 
ing plaintext digraphs* 

n c * ♦ " e - • [ l 

transposition cipher . 1* A transposition system. 2* A cryptogram pro¬ 
duced by enciphering a message with a transposition system* _ 

trsnspositicn-mlxed cipher alphabet . A cipher alphabot In which at least 
one ccsiponcnT (plain or clphet) has been constructed by‘applying a 
fomn of transposition to either a standard or a mixed sequence* 


transposition system . A cryptosystem in whlch_ toe_e_lements of plain text^ 

whoxLer individual letters, groups of letters, 'syllables, words, phrases, 
sentences, or code groups or their components undergo same Change In 
tholr xelative positions without a change in their Identities* • 

•v Li 1 w m ^ * 1 *■** 

trigraph, n* A Bet of three letters* 

^ _ *. ^ - j- f- 

tiigronhic, adj. Of or pertaining to any three-character group. 

1 “ 1 ' ' ' • 'i 11 

hir^g phic f ienuency distribution . A frequency dlsteibution of successive 
"‘Trjcrui* graphic frequency distribution of AECECF would, qgn- . 

sjder crly ■'13crophs A3£ end 1TF * (OC. trjla t oivl l ioo ucrcy distrl- 

T- ul ion ) 
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trigraphic substitution system . A substitution system In which the 
plaintext units ore composed of three elements. 

trilitcral, a&j, Of, or pertaining only to cryptosystems, cipher 
alphabets, and' frequency fiisti’ioutions which involve .ci pher 
units of three letters or characters. See the more inclusive 
term trlgraphic ; see also triliteral frequency distribution . 

triliteral frequency distribution . A distribution of the characters 
in the text of a message in sets of three, which will shows 
(a) each character with lt3 two preceding characters or (b) 
each character with its two succeeding characters, or, in its 
most usual form, (c) each character with one preceding and one 
succeeding character* A triliteral frequency distribution of 
A3BCPKF would consider the groups ABC, BCD , CDS, DSF. 

trinome, n. A set of three digits. 

trinoroo-digraphic system . A substitution system in which plain¬ 
text digraphs are represented by 3-digit cipher elements. 

trough, n. In its cryptologic application, a point of low relative 
frequency in a frequency distribution. 

true polygraphic system . Any polygraphic substitution system in which 
the individual elements of the polygraphs display no evidence 
of monoalphabeticity, nor evidence of relationships within any 
group of polygraphs; that is, in a true polygraphic system, chang 
ing one letter in any plaintext polygraph affects the equivalent 
ciphertext polygraph in its entirety, (Cf. partially-polygraphic* 
system and psev do-polygraphic system. ) 

two-element differential . The characteristic incorporated in certain 
codes in which the groups differ from one another by a minimum 
of two elements, either in identity or the positions occupied. 

•. When the elements ape letters, the characteristic is called a 
two-letter differential ; when the, elements are digits, it is 
called a two-digit differential . 

two-part code. A randomized code;, consisting of an encoding section 
‘Which the plaintext groups are arranged in alphabetical or 
other significant order accctopanicd by their code groups arranged 
in. a non-alphabetical or random order; and a decoding section, 
in Which the code groups are arranged in alphabetical or numeri¬ 
cal order and are accompanied by their meanings as given in the 
encoding section. 

two-square matrix system. A digraphic substitution system which 
nornlly enploys "a matrix consisting of two 5x5 squares 
arranged either horizontally or vertically. 
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uniliteral , adj. Of, or pertaining only to cryptosystems, dither alphabets, 
and frequency distributions which involve cipher units of single* ' 
letters or characters• See the more inclusive term monographic ; see 
also uniliteral frequency distribution . 

* r-f' ■_ j '■ 

unlllteral frequency distribution « A simple tabulation showing the fre¬ 
quency "of^in^’ivI3ual"‘ciiiaracters of a text* 

unlllteral substitution * A cryptographic process in which the individual 
letters of a message text are replaced by single-letter cipher 
equivalents. 

variant , n. 1* One of two or more cipher or code symbols which hove the 
same plain equivalent; also called variant value * 2* One of several 
plaintext meanings which may be represented by a single code group* 

variant system * A substitution system in vhich some or all plaintext 
letters may be represented by more than one cipher equivalent * 

variant value . See variant . 

vertical two-square matrix system . A digraphic substitution system employ- 
ing a matrix which normally consists of two 5*5 squares arranged 
vertically* 

Vlgenere square * The cipher square commonly attributed in cryptographic 
literature to the French cryptographer Vlgen&re, having the normal 
sequence at the top (or bottom) and at the left (or right), with 
cyclic permutations of the normal or other sequence forming the suc¬ 
cessive rows (or columns) within the square* 

visible writing * Writing in which the characters are inscribed with ordi- 
nary writing materials and can be seen with the naked eye. (Cf* invis ¬ 
ible writing .) * { . 

Wheatstone cipher device * A cipher device consisting essentially of two 
rings mounted concentrically in! a single plane, .the outer (and larger) 
ring being the plain component of the device and comprising 27 equl- 
slzed divisions, the inner (and smaller) ring being the cipher com¬ 
ponent, comprising 2 6 similar divisions* The device incorporates two 
hands (similar to those on a dock) pivoted at the center of the de- . 
vice—the larger hand serving the outer ring and the smaller hand, 
the inner—so geared together that for each complete revolution of the 
larger, the smaller turns through one complete revolution plus one 
twenty-sixth* 

word pattern . The characteristic arrangement of repeated letters in a word 
which tends to make it readily identifiable when enciphered monoalpha- 
betically. 
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word separator , A unit of one or acre characters employed in certain 
cryptosystems to Indicate the space between words« It may be ' 
enciphered or unenciphered* Also called a word spacer * 

word transposition * A cryptosystem in Which whole words are trans¬ 
posed according to a certalp pre a rranged route or pattern* 
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ENGLISH CRYPTANALYTIG DATA 
FREQUENCY TABLES 

l*kbl« No Paso 

1-A Absolute frequencies of letters appearing in five sets of Governmental plain-text telegrams, each set 

containing 10,000 letters, arranged alphabetically____ 4 

1-B Absolute frequencies of letters appearing m five sets of Governmental plain-text telegrams, each set 

containing 10,000 letters, arranged according to frequency____._ 6 

1- C Absolute frequencies of vowels, high-frequency consonants, medium-frequency consonants, and low- 

frequency consonants appearing m five sets of Governmental plain-text telegrams, each set con¬ 
taining 10,000 letters. 5 

2- A Absolute frequencies of letters appearing m the combined five sets of messages totalling 50,000 letters, 

arranged alphabetically__-_____ 6 

2-B Absolute frequencies of letters appearing m the combined five sets of messages totalling 50,000 letters, 

arranged according to frequency__ 6 

2-C Absolute frequencies of vowels, high-frequency consonants, medium-frequency consonants, and low- 

frequency consonants appearing m the combined five sets of messages totalling 50,000 letters_ 6 

2-D Absolute frequencies of letters as initial letters of 10,000 words found m Governmental plain-text tele¬ 
grams (1) Arranged alphabetically and (2) arranged according to frequency___ 6 

2-E Absolute frequencies of letters as final letters oi 10,000 words found m Governmental plain-text tele¬ 
grams (1) Arranged alphabetically and (2) arranged according to frequency_ 7 

8 Relative frequencies of letters appearing in 1,000 letters based upon Table 2-B (1) Arranged alpha¬ 

betically, (2) arranged according to frequency, (3) vowels, (4) high-frequency consonants, (5) me¬ 
dium-frequency consonants, and (6) low-frequency consonants_ 7-8 

4 Frequency distribution for 10,000 letters of literary English, as compiled by Hitt (1) Arranged 

alphabetically and (2) arranged according to frequency__—... 8 

5 Frequency distribution for 10,000 letters of telegraphic English, as compiled by Hitt (1) Arranged 

alphabetically and (2) arranged according to frequency_-_ 8 

6-A Frequency distribution of digraphs, based on 60,000 letters of Governmental plain-text telegrams, 

reduced to 6,000 digraphs_._ 9 

6- B Frequency distribution of digraphs (naval text) based on 20,000 letters of naval text, reduced to 

2,000 digraphs......... 10 

7- 11 Absolute frequencies of digraphs, tngraphs, and tetragrapha and the logarithms of their as¬ 

signed probabilities___11-88 

7-A The 428 different digraphs of Table 6-A, arranged according to their absolute frequencies, accom¬ 
panied by the logarithms of their assigned probabilities__.. 18-16 

7-B The 18 digraphs composing 25% of the digraphs m Table 6-A, accompanied by the logarithms of 
their assigned probabilities, arranged alphabetically according to their initial letters (1) and ac¬ 
cording to their final letters (2) and according to their absolute frequencies_ 15 

7-G The 63 digraphs composing 60% of the 6,000 digraphs in Table 6-A, accompanied by the logarithms 
of their assigned probabilities, arranged alphabetically according to their initial letters (1) and ac- 

*' cording to their final letters (2) and according to their absolute frequencies...... 16 

7-D The 122 digraphs composing 75% of the 5,000 digraphs in Table 6-A, accompanied by the loga¬ 
rithms of their assigned probabilities, arranged alphabetically according to their initial letters (1) 

and according to their final letters (2) and according to their absolute frequencies-17-18 

7-E AH the 428 digraphs of Table 6-A, arranged first alphabetically according to their initial letters 

and then alphabetically according to their final letters------ 18 

8 The 428 different digraphs of Table 6-A, arranged first alphabetically according to their initial 
letters and then according to their absolute frequencies under each initial letter, accompanied by 

tho logarithms of their assigned probabilities-19-21 

9-A The 428 different digraphs of Table 6-A, arranged first alphabetically according to their final 
letters and then according to their absolute frequencies, accompanied by the logarithms of their 

assigned probabilities----22-24 

9-B The 1£ digraphs composing 25% of the 6,000 digraphs of Table 6-A, accompanied by the loga¬ 
rithm of their assigned probabilities, arranged alphabetically according to their final letters (1) and 
according to their initial letters (2) and according to their absolute frequencies-25 
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Table No t>#e« 

9-C The 63 digraphs composing 60% of the 6,000 digraphs of Table 6-A, accompanied by the loga¬ 
rithms of their assigned probabilities, arranged alphabetically according to their final letters (1) and 

according to their initial letters (2) and according to their absolute frequencies..-—-_26-26 

9-D The 122 digraphs composing 76% of the 6,000 digraphs of Table 6-A, accompanied by the loga¬ 
rithms of their assigned probabilities, arranged alphabetically according to their final letters (1) 

and according to their initial letters (2) and accordmg to their absolute frequencies.26-28 

9-E All the 428 different digraphs of Table 6-A, arranged alphabetically first accordmg to their final letters 

and then accordmg to their initial letters........ 28 

10-A The 66 trigraphs appearing 100 or more times in the 50,000 letters of Governmental plam-text tele¬ 
grams, arranged accordmg to their absolute frequencies, accompanied by the logarithms of their 

assigned probabilities--.- 28 

10-B The 66 trigraphs appearing 100 or more times m the 60,000 letters of Governmental plam-text tele¬ 
grams, arranged first alphabetically accordmg to their initial letters, and then accordmg to their 

absolute frequencies, accompanied by the logarithms of their assigned probabilities.. 29 

10—C The 66 trigraphs appearing 100 or more times m the 60,000 letters of Governmental plam-text tele¬ 
grams, arranged first alphabetically accordmg to their central letters, and then according to their 
absolute frequencies, accompanied by the logarithms of their assigned probabilities..29-80 

10- D The 66 trigraphs appearing 100 or more times in the 60,000 letters of Governmental plam-text tele¬ 

grams, arranged first alphabetically accordmg to their final letters, and then accordmg to their abso¬ 
lute frequencies, accompanied by the logarithms of their assigned probabilities___ 80 

11- A The 64 tetragraphs appearing 60 or more times in the 50,000 letters of Governmental plain-text 

telegrams, arranged accordmg to their absolute frequencies, accompanied by the logarithms of 

their assigned probabilities--...--- 81 

11-B The 54 tetragraphs appearing 60 or more times m the 60,000 letters of Governmental plain-text 
telegrams, arranged first alphabetically accordmg to their initial letters, and then accordmg to their 

absolute frequencies, accompanied by the logarithms of their assigned probabilities... 81 

11-C The 64 tetragraphs appearing 60 or more times in the 60,000 letters of Governmental plam-text 
telegrams, arranged first alphabetically according to their second letters, and then according to 

their absolute frequencies, acdbmpafiied by the logarithms of their assigned probabilities.. 82 

11-D The 64 tetragraphs appearing 60 or more times m the 60,000 letters of Governmental plain-text 
telegrams, arranged first alphabetically according to their third letters, and then according to 
their absolute frequencies, accompanied by the logarithms of their assigned probabilities.......... 82-88 

11-E The 64 tetragraphs appearing 60 or more times in the 60,000 letters of Governmental plain-text 
telegrams, arranged first alphabetically according to their final letters, and then accordmg to their 

absolute frequencies, accompanied by the logarithms of their assigned probabilities... 88 

12 Average length of words and messages..._. 84 

18 Checkerboard individual frequencies...*. 86 

14 Relative logarithmic values of frequencies of English digraphs..... 86 

16 Relative logarithmic values (Logo 222) of frequencies of English digraphs.. 87 

* * * * * 

SPECIAL-PURPOSE DATA 

l6-A. Frequency distribution of digraphs, based on 6h t 365 letters of 
decrypted U. S. Government messages in which Z was used as a 

word-separator and X was used for both Xp and Zp.«... 38 

l 6 -B. Frequency distribution of digraphs, based on the texb used for 
Table 16 -A, from which the Z word-separator has been omitted 

(total. 53,866 letters). 39 

l6-C • The 53 digraphs from Table 6-A which comprise 50$ of the to¬ 
tal, arranged according to frequencies reduced to a base of 
5,000 digraphs, shown with the corresponding frequencies of 
the same digraphs from Table l6-B (also reduced to a base 
of 5,000).... hO 
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Tablb 1-A —Absolute frequencies of letters appearing mfive sets of Governmental plain-text telegrams, 
each set containing 10,000 letters, arranged alphabetically 


Set No 1 

Set No 2 

Set No 8 

Set No 4 

set No. 5 

Lofcter 

Absolute 

Frequency 

.Letter 

Absolute 

Frequency 

Letter 

Absolute 

Frequency 

Letter 

Absolute 

Frequency 

Letter 

Absolute 

Frequency 

A __ 

738 

EBP 

783 

A__ 

681 

A._* 

740 


741 

B_ 

mm 

IjHM 

103 

E 

98 

B_ 

83 

- 

B 

99 

C_ 

319 


800 


288 

c 

826 

n 

301 

D _ . 

887 

iSBBi 

413 

MHHi 

423 

D 

461 

n 

448 

E. _ 

1,367 

E. 

1,294 

E_ 

1,292 

E_ 

1,270 

E_ .. 

1,276 

F 

253 

7? 

287 

r— 

808 

F„,_ 

287 

F 

281 

G . 

166 

G 

175 


161 

G _ _ 

167 

G 

150 

H . _ . 

810 

H_ 

361 

H 

335 

H_ __ 

349 

H 

349 

I_ 

742 

I_ _ _ 

760 

I _ 

787 

I 

700 

T 

697 

>T .. .... 

18 

J - 

17 

J . 

10 

n 

21 

J _ 

16 

K_ 

86 

K__ 

38 

K _ 

22 


21 

X . 

81 

T. 

365 

L. _ 

393 

L 

333 

hJEm 

886 

T. 

844 

M _ 

242 

M__ 

240 

M_ 

238 

M 

249 

M 

268 

N 

786 

N_ 

794 

N_ 

815 

N 

800 

N 

780 

0 

685 

0 _ 

770 

0__ 

791 

O 

766 

0_ 

762 

P 

241 

P ^ - 

272 

P 

817 

P 

245 

P _ 

260 

Q__ 

40 

Q_ 

22 

Q_ _ „ 

45 

0- - 

38 

Q__ 

80 

B 

760 

R_ 

745 

B 

762 

R 

735 

B 

786 

j 

S _ 

658 

S_ 

583 

s _ 

585 

s 

628 

S 

604 

T _ 

986 

T_ 

879 

T 

894 

T 

958 

T_ 

928 

n 

270 

U_ 

233 

U_ 

812 

II _ 

247 

II 

238 

V 

163 

V 

173 

V 

142 

V 

183 

V 

155 

w 

166 

W _ 

163 

W 

136 

V 

183 

W 

182 

X _ 

43 

x__ 

60 

X 

44 

X 

53 

X 

41 

V _ 

191 

y _ 

165 

Y _ 

179 

Y 

213 

Y_ 

229 

Z 

14 

7 . 

17 

Z 

2 

7 . 

11 

z_ 

5 











Total_ 

10,000 


10,000 


10,000 


10,000 


10,000 
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Table 1-B —Absolute frequencies of letters appearing in five sets of Governmental plain-text telegrams, 
each set containing 10,000 letters, arranged according to frequency 


Set No 1 

Set No 2 

Set No 8 

Set No 4 

Set No 6 

Letter 

Absolute 

Frequency 

Letter 

Absolute 

Frequency 

Letter 

Absolute 

Frequency 

Letter 

Absolute 

Frequency 

Letter 

Absolute 

Frequency 

EL_ 

1,367 

936 

F! 

1,294 

879 

m/m 

1,292 


1,270 

968 

E_ 

1,276 


T _ 


894 


T - 

928 

N_ 

786 

N_ 

794 


816 

N 

800 

a _ 

786 

R. _ 

760 


783 


791 

0 _ 

766 

N _ 

780 

I _ 

742 


770 

I 

787 

A _ 

740 

0 _ 

762 

A 

738 


760 

R_ 

762 

R_ 

785 

A 

741 

0 _ 

686 

R _ _ 

746 

A. - 

681 

nMBHHHi 

700 

1. _ 

697 

s__ 

668 


683 

S_ _ , 

585 


628 

s 

604 

D._ 

387 


413 

D_ 

423 


461 

D_ 

448 

L __ _ 

366 


393 

a 

335 


886 

a 

849 

fi 

319 

a _ 

361 

L. _ 

333 


349 

T. 

344 

H._ 

310 

c _ 

300 

P 

317 

c 

826 

c _ 

301 

IT 

270 

F 

287 

u 

312 

F 

287 

F__ 

281 

F_-- 

263 

p _ 

272 

F 

308 

a _ 

249 

M_ 

268 

Al 

242 

IL _ „ 

240 

C „ _ . 

288 

u_ 

247 


260 

P 

241 

U 

233 

a 

238 

p. 

245 

m 

238 

Y 

191 

G 

176 

Y_ 

179 

Y 

213 


229 

G_ 

166 

V 

173 

G 

161 

G 

167 

w_ 

182 

W_ 

166 

W 

163 


142 

V 

133 

V 

155 

V_ 

163 

Y 

165 

w 

136 

w 

133 

G_ 

160 

B_ 

104 

B_ _ 

103 

B 

98 

B 

88 

B 

99 


43 

X. _ _ 

50 

Q.- 

45 


63 

a_ 

41 

q 

40 

K 

38 

X. _ 

44 

Q - - 

38 

a 

31 

K- 

36 

Q- - — 

22 

K 

22 

a_ 

21 


30 

J_ 

18 


17 

J 

10 

j - 

21 


16 

Z._- 

14 


17 

Z_ 

2 

z. _ 

11 


5 

TotaL- 

10,000 


10,000 


10,000 

— 

10,000 

B 

10,000 


Table 1-C —Absolute frequencies of vowels, high-frequency consonants, medium-frequency con¬ 
sonants, and loro-frequency consonants appearing in five sets of Governmental plain-text tele¬ 
grams, each set containing 10,000 letters 
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Table 2-A —Absolute frequencies of letters appearing in the combined five sets of messages totalling 

50,000 letters, arranged alphabetically 


A ... 

3,683 

G 

819 

L 

1,821 

Q. . 

175 

V . . 

766 

B_ 

J87 

H 

1,694 

M 

. 1,237 

R _. 

3,788 

V i . . 

780 

C .. 

1,634 

I .. 

3,676 

N 

8,976 

S 

3,068 

X . . 

231 

D_ 

2,122 

J... 

82 

0 

. 3,764 

T . 

4,595 

Y. 

967 

E ... 
F . . 

6,498 

1,416 

K . 

148 

P 

1,336 

U. . 

1,300 

Z . . . 

49 


Table 2-B —Absolute frequencies of letters appearing m the combined five sets of messages totalling 

50,000 letters, arranged according to frequency 


E 

.. 6,498 

I. . .. 

3,676 

C 

. 1,534 

Y 

967 

X. . 

231 

T . 

.... 4,695 

S . .. 

3,058 

F 

1,416 

G 

819 

Q. .. . 

175 

N 

.. 3,975 

D ... . 

2,122 

P 

1,335 

W. .. 

780 

K .... 

148 

R. 

... 3,788 

L... . 

1,821 

U 

.. 1,300 

V 

766 

J ... 

82 

O... 

.... 3,764 

H.. 

1,694 

M 

. 1,237 

B 

487 

Z. ... 

49 


A_3,683 


Table 2-C —Absolute frequencies of vowels, high-frequency consonants, medium-frequency con- 
sonants, and low-frequency consonants appearing m the combined five sets of messages totalling 


50,000 letters 

Vowels...-.... 19,888 

High-frequency consonants (D, N, R, S, and T).17,638 

Medium-frequency consonants (B, C, F, G, H, L, M, P, V, and W).11,889 

Low-frequency consonants (J, K, Q, X, and Z)- 685 


Total.-..-.—... 60,000 


Table 2-D —Absolute frequencies of letters as initial letters of 10,000 words found m Governmental 

plain-text telegrams 

(1) ARRANGED ALPHABETICALLY 


A . 

905 

G 

109 

L 

196 

Q - 

30 

V 

77 

B. . 

287 

H 

272 

M . _ 

384 

R 

611 

W .. 

320 

C 

664 

I 

. _ 344 

N . . 

441 

S ... 

965 

X . 

4 

D 

525 

J 

44 

O. 

646 

T ... 

1,253 

Y .... 

88 

E . 

390 

K 

23 

P 

433 

U. .. 

122 

Z_ . 

12 

F _ . 

855 

















Total 

_10,000 




(2) ARRANGED ACCORDING TO 

FREQUENCY 



T 

1,253 

R 

611 

M 

384 

L 

196 

J. . 

44 

S . - 

965 

D. 

525 

I 

344 

U 

122 

Q .. . 

30 

A 

905 

N 

. .. 441 

W 

320 

G 

109 

K 

23 

F. .. . 

855 

P 

. . 433 

B 

287 

Y - 

88 

Z 

12 

C _. 

664 

E 

390 

H 

272 

V 

77 

X 

4 

0 

646 

















Total 

10,000 
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Table 2-E — Absolute frequencies of letters as final letters of 10,000 words found m Governmental 

plain-text telegi a,ns 


(1) ARRANGED ALPIIABETICAELY 


A 

269 

G. 

225 

L_ 

354 

Q 

. , 8 

V 


4 

B 

22 

H 

. 450 

M 

16l 

R 

*709 

W 


45 

C 

86 

I 

22 

N 

872 

S 

' '962 

X 


116 

D „ 

_ 1,002 

J. 

_ , 6 

0 . , 

575 

T- 

™ 1,007 

Y 

_ _ 

866 

E 

. 1,628 

K 

53 

P 

213 

U 

31 

Z 

_ 

9 

F 

. 252 








— 










Total 

..10 : 

,000 




(2) ARRANGED ACCORDING TO 

FREQUENCY 




E 

1,628 

R 

769 

F 

252 

c 

86 

I 


22 

T . 

1,007 

0 

. 575 

G 

225 

K 

53 

Z 


9 

D . 

.. 1,002 

H 

450 

P 

213 

W 

45 

Q 


8 

S 

962 

L 

354 

M 

154 

u 

31 

J . 


6 

N .. 

. . 872 

A 

269 

X 

116 

B. 

22 

V . _ 


4 

Y . 

_ 866 


















Total . 

10 

,000 


Table 3 —Relative frequencies of letters appearing in 1,000 letters based upon Table 2-B 


(1) ARRANGED ALPHABETICALLY 


A . 

73 

66 

G . 

. .. 16 

38 

L 

36 

42 

Q 

3 

50 

V 

.. 15 32 

B... .. 

9 

74 

H 

33 

88 

M 

24 

74 

R _ 

75 

76 

W 

. _ 15 60 

C _ . . 

30 

68 

I 

73 

52 

N 

79 

50 

s . . 

61 

16 

X 

. . 4 62 

D 

_42 

44 

J 

l 

64 

0 . 

75 

28 

T 

91 

90 

Y 

19 34 

E . . 

129 

96 

K 

2 

96 

P 

26 

70 

U 

26 

00 

Z 

&8 

F... . 

28 

32 











1 

j 







9 





Total 

1,000 ()0 





(2) ARRANGED ACCORDING 

TO 

FREQUENCY 



1 

E. 

129 

96 

I 

73 

52 

C 

'30 

68 

Y 

19 

34 

X 

. . 4 62 

T 

91 

90 

S . 

61 

16 

F 

28 

32 

G 

16 

38 

Q 

3 50 

N. 

79 

50 

D 

42 

44 

P 

26 

70 

W . 

15 

60 

K 

. 2 96 

R . 

75 

76 

L 

. . 36 

42 

U 

26 

00 

V 

15 

32 

J 

1 64 

0. . . 

75 

28 

H 

. 33 

88 

M 

24 

74 

B 

9 

74 

Z 

98 

A ... 

73 

66 























Total 

1,000 00 




(3) VOWELS 




(4) HIGH-FREQUENCY 




A. 




_ 73 

66 



CONSONANTS 





E 




129 

96 

D. 


_ 


42 

44 



I 




73 

52 

N 


„ __ 

r - 

79 

50 



0 




75' 28 

R 


- 

- 

75 

76 



U 




26 

00 

S 

- 

— 


61 

16 



Y 


— 


19 

34 

T 



— 

91 

90 





Total 

3.97 76 



Total 

350 

76 
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Table 8, Contd —Relative frequencies of letters appearing ifl 1,000 letters based upon Table 2-B 


(6) MEDIUM-FREQUENCY (6) LOW-FREQUENCY 

CONSONANTS CONSONANTS 


B .... 

9 74 

X 

. 4.62 

C . . 

80 68 

Q- 

. 3 50 

F.. ... . 

28 82 

K 

2 96 

G. 

16 38 

J. 

_ 1 64 

H . .. . ... 

33 88 

Z 

98 

L . 

M. 

36 42 

24 74 


Total . 18 70 

P . 

V . 

W ... 

Total 

26 70 

16 32 

16 60 

237 78 

- 

Total (3), (4), 

(6), (6).. . 1,000 00 


Table 4 —Frequency distribution for 10,000 letters of literary English, as compiled by Hitt 1 

(1) ARRANGED ALPHABETICALLY 


A .... 

778 

G 

.. .. 174 

L .. 

372 

Q. .. . 

8 

V .. . 

112 

B. 

141 

H 

. . .. 595 

M 

288 

R 

651 

W . .. 

176 

C . .. 

296 

I 

. . 667 

N . . 

686 

S . .. 

622 

X 

27 

D . 

402 

J 

. _ 51 

0 .. . 

807 

T. 

855 

Y_ 

196 

E 

F . .. 

1,277 

197 

K 

... 74 P. 223 

(2) ARRANGED ACCORDING TO 

U . . 308 

i FREQUENCY 

Z ... . 

17 

E . . 

1,277 

R 

. 651 

U_ 

308 

Y_. . . 

196 

K. ... 

74 

T. . .. 

855 

S 

_ 622 

C.. _. 

296 

W ... 

176 

J. 

51 

0_ 

807 

H 

_ 595 

M . _ 

288 

G . 

174 

X . .. 

27 

A .... 

778 

D 

... 402 

P... ._ 

223 

B . 

141 

Z .... 

17 

N... . 
I. 

686 

667 

L. 

. 372 

F... . 

197 

V .. 

112 

Q- . 

8 


Table 5 —Frequency distribution for 10,000 letters of telegraphic English, as compiled by Hitt 1 

(1) ARRANGED ALPHABETICALLY 


A .... 

.. 813 

G. 

. 201 

L. 

392 

Q ..... 

38 

V. . . 

.. 136 

B-. . 

. 149 

H 

_ 386 

M_ 

273 

R_ .._ 

677 

ff . ... 

.. 166 

C. .. 

. 306 

I_ 

_ 711 

N. .. 

718 

S.. .. 

65 6 

X ... 

51 

D.... 

. 417 

J. 

_ 42 

O_ 

844 

T_ 

634 

Y .. 

. 208 

E_ 

. 1,319 

K 

_ 88 

P. 

243 

U _ 

321 

Z . .. 

6 

F. 

_ 205 












(2) ARRANGED ACCORDING TO 

i FREQUENCY 



E .. 

. 1,319 

S. 

_ 656 

U . 

321 

F_ 

205 

K_ 

88 

0 . 

844 

T. 

634 

c_ 

306 

G _ 

201 

X. 

51 

A_. 

813 

D. 

. 417 

M__ 

273 

W_ 

166 

J .. . 

42 

N . 

.. 718 

L_ 

. ... 392 

P_ 

243 

B_ 

149 

Q. .. 

38 

I . . 

_ 711 

H. 

.. .. 386 

Y.. .. 

208 

V... . 

136 

Z ... 

6 

R. 

677 










1 Hitt, Capt Parker Manual for the Solution of Military Ciphers Army Service Schools Press, Fort 
Leavenworth, Kansas, 1916 
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Table 6-A —Frequency distribution of digraphs, based on 50,000 letters of Governmental plain-teat 

telegrams, reduced to 5,000 digraphs 

Second Letter 

Total 



A 

B 

C 

D 

E 

F 

G 

H 

I 

J 

K 

L 

M 

N 

0 

P 

Q 

R 

S 

T 

u 

V 

w 

X 

Y 

Z 

1 

Blanks 

A 

3 

6 

14 

27 

1 

4 

6 

2 

17 

1 

2 

82 

14 

64 

2 

12 


44 

41 

47 

IS 

7 

3 


12 


374 

8 I 

8 

4 




18 




2 

i 


6 

i 


4 



2 

1 

1 

2 




7 


49 

EE 

C 

1 


8 

i 

82 

i 


14 

7 


4 

5 

1 

1 

41 



4 

...... 

14 

4 


1 


"i 


166 

EE 

D 

82 

4 

4 

8 

33 

8 

2 

2 

27 

1 


8 

6 

4 

16 

6 

2 

12 

18 

"l6 

6 

1 

1 


~L 


209 

HE 

E 

85 

4 

32 

1 

42 

18 

4 

7 

27 

1 


29 

14 

111 

12 

E 

12 

87 

64 

87 

3 

1 

7 

7 

4 

1 

648 

EB 

F 

5 


2 

1 

B 

ii 

i 


89 



2 

1 


E 

1 


9 

3 

11 

3 


"1 


"l 


141 


G 

7 


2 

1 

14 

2 

1 

E 

5 

i 


... ^ 

1 

3 

6 

2 


5 

3 

4 

2 


~1 




82 

EE 

H 

m 

1 

3 

2 


6 



33 



1 

2 

3 

E 

1 

1 

17 

4 

28 

8 


_ 

i 


"I 



MB 

i 

8 

2 

22 

6 

18 

E 

19 




2 

23 

9 

75 

41 

7 


27 

86 

~27 


26 


16 


~2 

368 

EE 

j 

i 




2 










2 






2 






7 


K 

1 


1 


6 




2 



i 


i 





1 








13 

EE 

i L 

28 

3 

3 

9 

37 

3 

1 

1 

E 



27 

2 

1 

13 

3 


2 

6 

8 

2 

~2 



i 


188 


U M 

36 

6 

3 

1 

26 

1 


1 

9 




13 


E 

8 


2 

4 

2 

2 




n 


126 

HE 

I N 

26 

2 

19 

52 

57 

9 

27 

4 

E 

1 

2 

5 

5 

8 

18 

3 

1 

4 

24 

82 

7 

8 

8 


1 


397 

Hi 

ft 0 

7 

4 

8 

12 

3 

25 

2 

3 

5 

H 

n 

19 

25 

77 

6 

25 


64 

14 

19 

37 

7 

8 

1 

2 


376 

EE 

p 

14 

1 

1 

1 

28 

2 


8 

6 



13 

4 

1 

17 

11 


18 

6 

8 

3 

1 

1 




185 

ee 

Q 













1 





1 



15 






17 

2a 

R 

89 

2 

9 

17 

98 

6 

7 

3 

E 

1 

1 

6 

9 

7 

28 

13 


11 

81 

42 

6 

5 

~4 


~9 


882 

EE 

S 

24 

~3 

13 

5 

49 

12 

2 

26 

84 


~1 

2 

3 

4 

15 

E 


5 

19 

68 

11 

1 

~4 


1 


307 

By 

T 

28 

~3 

6 

6 

71 

7 

1 

78 

45 



5 

6 

7 

60 

- . 

2 

1 

17 

19 

19 

5 


36 

1 

41 

I 

454 

mg 

U 

5 

3 

3 

8 

11 

1 

~8 


6 



6 

5 

21 

1 

2 


31 

12 

12 


1 


- 1 



130 

EE 

V 

■ 




67 




12 






1 





1 







77 

21 

W 

i 




22 



4 

18 



1 


2 

19 



i 

i 






"l 


76 

m 

X 

2 


E 

1 

1 

1 



2 





1 

1 

2 


1 

1 

7 







23] 

13 

y 

6 

2 

E 

c 

9 

11 

"7 


3 



2 

2 

6 

E 

3 


4 

u 

15 

1 

i 

1 



_i 

96 

EE 

Z 

1 



■ 

2 




1 



i 

i 










■ 





■ 

g 

Total. 

m 

46 

154 

217 

657 

187 

82 

170 

374 

8 

14 

189 

123 

397 

378 

B 

17 

368 

304 

462 

E 

76 

77 

23 

99 

4 

5 f 000 

I | 

Blanks- 

1 

11 

6 

7 

1 

7 

12 

m 

3 

18 

19 

6 

6 

7 

3 

8 

21 

4 

4 

5 

M 

15 

n 

23 

10 

23 

■ 

EE 
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Tables 7-11, Inclusive 

Absolute frequencies of digraphs, tngraphs, and tetragraphs and the logarithms of their 

assigned probabilities 1 

1 For each of the following 18 tables, the basic data were first arranged according to their 
absolute frequencies (F), and then the logarithms—L M (F) of the frequencies found 

2 The tables are designed to facilitate determination of the relative weights or probability 
of occurrence of *ets of digraphs, tngraphs, or tetragraphs, particularly with respect to various 
“matching” operations For pxample, are the matched digraphs RE and ET more probable than 
the matched digraphs RT and EF ? Table 7-A shows the frequencies (F) of the digraphs to be 
as follows RE=98, ET=37, RT=42, EF=18 Therefore, 98 times 37 is compared with 42 
times 18, or 3,626 with 766 This arithmetic method of approach is extremely cumbersome for 
a large number of comparisons By using the logarithms of the individual frequencies, the opera¬ 
tion is greatly simplified, since the addition of the logarithms of two numbers is equivalent to 
the multiplication of their equivalent arithmetic values Thus, the foregoing computation may 
be expressed as Log 98+Log 37, compared with Log 42+Log 18, or 0 96+0 79 versus 0 81+0 66 
(see Table 7-A and explanation below) If more than one occurrence of a particular digraph is 
involved, it is merely necessary to multiply the logarithmic value by the number of the occur¬ 
rences, viz , Log X4 2(Log Y)+3(Log Z), as compared with Log A+3(Log B)+2(Log C) 

3 The logarithm of any given number is the power to which 10 must be raised to equal the 
given number Thus, 10 2 = 100, or the logarithm of 100=2 Similarly, 10 s =1,000, or the loga¬ 
rithm of 1,000 =3 The sum of logai ithms is equal to the logarithm of the product of their antilogs 
(arithmetic numbers they represent) For example, 10 2 =100, 10®=1000,10 2+ ®=100X1000, Log 
100,000 = 5 Also, 10°=1, or Log 1=0 The Log of 0 is minus infinity (-■») 

4 In the compilation of the logarithms of the elements constituting these tables, frequencies 
of 1, of course, had a logarithmic value of 0 00 Digraphs which did not occur , 2 1 e., those with 
0 occurrences, had a logarithmic value of minus infinity (-<») For practical use, each of 
the original frequency occurrences m these tables was doubled, i e, EN was given a frequency 
of 222 instead of 111, the frequency of RE became 196 instead of 98, etc Thus, single occur¬ 
rences were doubled (2x1=2), and the logarithms of those elements became 030 instead of 0 
This is equivalent to saying Log 1+Log 2=0 00+0 30=0 30 Those elements which occurred 
0 times, now were assumed to have an occurrence of 1, with an equivalent logarithmic value of 0 00 

5 In order to place all the logarithms of the initial frequencies on a comparable logarithmic 
basis, it was merely necessary to add 0 30 to each of them While EN had a frequency of 111 
in the original compilation, it now had a frequency of 222, or 2(111) The logarithm of 222 is 
2 36 This is equivalent to saying Log 111+Log 2=2 05+0 30=2 35 

6 The frequencies as stated in terms of their actual logarithms do not readily indicate their 
relative size for each distribution Therefore, the highest frequency m each group was given a 
value of 0 99, and the lowest a value of 0, frequencies intermediate between these extremes were 

i These frequency distributions are based upon data derived from 60,000 letters of U S Governmental plain-text telegrams, reduced to 6 000 
digraphs 

* 'While in general it is possible to assign probability values to digraphs in accordance with their observed frequencies, it is not strictly correct 
to associate the probability "0 with a frequency of zero This would be equivalent to saying Because a specified digraph has not occurred, it 
cannot occrn and would be reflected m the mathematics Log probability zero equals minus infinity ” What may be said is 'Since a specified 
digraph has not occurred in the data its true probability value is unknown except that it must be below the probability value assigned to a fre¬ 
quency of one ” The proper way to assign a probability value to digraphs with frequencieo of zero is to continue counting until they have at 
least one occurrence then the true relative probability can be found 

A simple practical method of taking thiB difficulty into account is merely to assume that in twice the amount of data the digraph probably 
would have occurred at least once that is it has a frequency of one-half 

It should be pointed out however, that since probabilities are multiplied (by summing logarithms) a 10% error in evaluating the digraph ZZ 
for example, makes the product, wherever ZZ occurs 10% wrong and is just as serious as a 10% error in evaluating the high frequency digraph EN 

In practice however results obtained from th* 1 logarithmic method are so satisfactory that refinements are not needed 
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evaluated in proportion tp their respective frequencies This is equivalent to expressing the 
frequencies in logarithms with a base other than 10 In other words, this procedure of converting 
the logarithms to the range from 00 to 99 consists m dividing up the original range of logarithms 
i into 100 equal parts and assigning each one to the proper rank m the range 

7 The new base (O) used to convert each of the digraphic frequencies to the logarithmic 
range 0 to 0 99 is derived as follows, when 222 is the highest frequency (F) 

Let222=C° 99 
Logjo 222=Logio C° ” 

Logio 222 = (0 99) (Logio C) 

C=224 


8 The formula for the computation of the logarithm to the new base (C) of any actual 
frequency (Y) of a series is 


T v LogtnY 
LogoY LogioG 

It is more expeditious to use reciprocals m the conversion of a whole series of logarithmic values, 
as m this instance The formula is (Logio C) -1 (Logio Y) = Log 0 Y 

9 The digraphic index chart, Table 15, on page 37, summarizes the logarithmic frequencies 
of all English plain-text digraphs, computed to a base of 224 so that the logarithm of the highest 
frequency (EN) is 0 99 

Example EN=222 


Logio 222=2 35 

(Log w C)" l =(Logio 224)“i=0 421 
Log, 222 = 0 421 X2 35 = 0 99 

10 • Likewise, the trigraphs and tetragraphs have been computed to the 
bases L 58 6 a® 13 - I<244 t respectively, so that the logarithms of the highest- 
frequency trigraph (EOT) and tetragraph (TIOE) are 0.99* Since no use is 
being made of the trigraphs appearing less than 100 times and tetragraphs 
appearing less than 50 times, the basic frequencies of the trigraphs and 
tetragraphs have not been doubled in computing the new bases of the loga¬ 


rithms. 
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Table 7-A — The i28 different digraphs of Table 6-A, arranged according to their absolute frequencies, 
accompanied by the logarithms of their assigned probabilities 


F 

L»(r> 

na 

mmm 

F 

Lio(F) 


F 

Lu(F) 

KM 

KZsB 

F 

Lio(F) | 

Liu 

<2F) 

en.. : 

Ill 

2 

05 

99 

DA.. 

32 


51 


OL . 

19 

1 

28 

67 


e 

a 

08 

58 

RE .. 

98 

1 

99 

96 

EC.. 

32 


51 



19 

1 

28 

67 

0D™ 

12 

l 

08 

58 

ER.. 

87 

1 

94 

94 

RS . 

31 


49 

75 

SS 

19 

1 

28 

67 

SF. - 

12 

l 

08 

58 

NT... 

82 

1 

91 

93 

UR. _ 

31 


49 

75 

TS. . 

19 

1 

28 

67 

US™. 

12 

1 

08 

58 

TH. . 

78 

1 

89 

92 

HI. . 

30 

1 

48 

75 

TT . 

19 

1 

28 

67 

UT .. 

12 

1 

08 

58 

ON „ 

77 

1 

89 

92 

RI. 

30 

1 

48 

75 

wo _ 

19 

1 

28 

67 

VI.. 

12 

1 

08 

58 

IN_ 

75 

1 

88 

92 

EL. 

29 

T 

46 

74 


18 

1 

26 

66 

WA._ 

12 

l 

08 

58 

TE.. 

71 

1 

85 

91 

HT.. 

28 

1 

45 

74 


18 

l 

26 

66 

FF™. 

11 

l 

04 

56 

AN. _ 

64 

1 

81 

89 

LA.. 

28 

1 

45 

74 

NO . 

18 

l 

26 

66 

FT™. 

11 

l 

04 

56 

OR .. 

64 

1 

81 

89 

RO.. 

28 

1 

45 

74 

PR. _ 

18 

l 

26 

66 

PP.. 

11 

l 

04 

56 

ST.. 

63 

1 

80 

88 

TA . 

28 

1 

45 

74 

AI__ 

17 

1 

23 

64 

RR._ 

11 

1 

04 

56 

ED. _ 

60 

1 

78 

88 

*2.495 




HR.„ 

17 

1 

23 

64 

SU. 

11 

1 

04 

56 

NE._ 

57 

1 

76 

87 






P0_. 

17 

1 

23 

64 

UE_. 

11 

1 

04 

56 

VE... 

57 

1 

76 

87 

AD . 

27 

1 

43 

73 

RD__ 

17 

1 

23 

64 

YF . 

11 

l 

04 

56 

ES._ 

54 

1 

73 

86 

DI _ 

27 

1 

43 

73 

TR._ 

17 

1 

23 

64 

YS._ 

11 

l 

04 

56 

ND_ 

52 

1 

72 

86 

El . 

27 

1 

43 

73 

DO.. 

16 

1 

20 

63 

FE.. 

10 

l 

00 

55 

TO_ 

50 

1 

70 

84 

IR.. 

27 

1 

43 

73 

DT._ 

16 

1 

18 

62 

IF.. 

10 

l 

00 

55 

SE. . 

49 

1 

69 

84 

IT . 

27 

1 

43 

73 

IX . 

15 

1 

18 

62 

LY. . 

10 

l 

00 

55 

1 1,249 




LL_. 

27 

1 

43 

73 

QU . 

15 

1 

18 

62 

M0_. 

10 

1 

00 

55 






NG . 

27 

1 

43 

73 

SO _ 

15 

1 

18 

62 

SP... 

10 

l 

00 

55 

AT_ 

47 

1 

67 

83 

ME 

26 

1 

41 

72 

YT_ 

15 

l 

18 

62 

Y0™ 

10 

1 

00 

55 

TI_. 

45 

1 

65 

82 

NA. . 

26 

1 

41 

72 

AC.. 

14 

l 

15 

61 

FR. _ 

9 

0 

95 


AR . 

44 

1 

64 

82 

SH 

26 

1 

41 

72 

AM 

14 

1 

15 

61 

IM.. 

9 

0 

95 


EE_ 

42 

1 

62 

81 

IV.. 

25 

1 

40 

72 

CH . 

14 

1 

15 

61 

LD_. 

9 

0 

95 

53 

RT _ 

42 

1 

62 

81 

OF.. 

25 

1 

40 

72 

CT._ 

14 

l 

15 

61 

MI.. 

9 

0 

95 

53 

AS_ 

41 

1 

61 

80 

OM 

25 

1 

40 

72 

EM _ 

14 

l 

15 

61 

NE¬ 

9 

0 

95 


CO... 

41 

1 

61 

80 

OP 

25 

1 

40 

72 

GE . 

14 

l 

15 

61 

RO... 

9 

0 

95 

53 

10 . „ 

41 

1 

61 

80 

NS. . 

24 

1 

38 

71 

OS . 

14 

l 

16 

61 

RM._ 

9 

0 

95 


TY_ 

41 

1 

61 

80 

SA 

24 

1 

38 

71 

PA _ 

14 

l 

15 

61 

RY_. 

9 

0 

95 

53 

FO_ 

40 

1 

60 

80 

IL . 

23 

1 

36 

70 

AU . 

13 

l 

11 

59 

YE_. 

9 

0 

95 


FI_ 

39 

1 

59 

80 

PE. 

23 

1 

36 

70 

DS 

13 

l 

11 

59 

DD 

8 

0 

90 

51 

RA . 

39 

1 

59 

80 

IC _ 

22 

1 

34 

69 

IE 

13 

1 

11 

59 

DF._ 

8 

0 

90 

51 

ET„ 

37 

1 

67 

79 

WE.. 

22 

1 

34 

69 

LO . 

13 

1 

11 

59 

HU _ 

8 

0 

90 

51 

LE_ 

37 

1 

57 

79 

UN_ 

21 

1 

32 

68 

MM. 

13 

1 

11 

59 

IA_. 

8 

0 

90 

51 

OU.. 

37 

1 

57 

79 

CA. 

20 

1 

30 

67 

PL.. 

13 

1 

11 

69 

LT._ 

8 

0 

90 

51 

MA.. 

36 

1 

56 

78 

EP_ 

20 

1 

30 

67 

RP_. 

13 

1 

11 

59 

MP._ 

8 

0 

90 

51 

TW.. 

36 

1 

56 

78 

EV_ 

20 

1 

30 

67 

SC__ 

13 

1 

11 

59 

NN__. 

8 

0 

90 

51 

EA... 

35 

1 

54 

78 

GH_ 

20 

1 

30 

67 


13 

l 

11 

69 

OC._ 

8 

0 

90 

51 

IS... 

35 

1 

64 

78 

HA. _ 

20 

1 

30 

67 

*3.745 




OW„ 

8 

0 

90 

51 

SI.. 

34 

1 

53 

77 

HE 

20 

1 

30 

67 






PT. . 

8 

0 

90 

51 

DE .. 

33 

1 

52 

77 

HO. _ 

20 

1 

30 

67 

AP . 

12 

l 

08 

58 

UG.. 

8 

0 

90 

51 

HI _ 

33 

1 

52 

77 

LI 

20 

1 

30 

67 

AY . 

12 

l 

08 

58 


7 

0 

85 

48 

AL . 

32 

1 

51 

76 

IG . 

19 

1 

28 

67 

DR._ 

12 

l 

08 

58 

BY 

7 

0 

85 

48 

CE. 

32 

1 

51 

76 

NC . 

19 

1 

28 

67 

EO . 

12 

1 

ilKl 

58 

Cl.. 

7 

0 

85 

48 


i The 18 digraphs above thu line compose -o c c of t! c total * The 122 digraphs above this Imc compo s 75% of the total 

9 The 53 digraphs above* tins line compose 60° of the toti.1 

2-13 


■ RESTItlCTEfr 




























































REF ID:A56892 



EH. . 
Eff.. 
EX . 
GA.... 
IP_. 
NU .. 
OA . 
OV . 
RG... 
RN . 
TF 
TN. 
XT . 
AB„. 
AG... 
BL . 
GO. 
ID .. 
KE . 
LS . 
MB. . 
00 _.. 
PI. 
PS .. 
RF.. 

TC_ 

TD .. 
TM .. 
UL .. 
VA . 
YA 
YN 
CL.. 
DM . 
DP.. 
DU . 
FA . 
GI 
GR 
HF 
NL 
NM 
NY .. 
01 .. 
RL 


7 0 85 48 
7 0 85 48 
7 0 85 48 
7 0 85 48 
7 0 85 48 

7 0 85 48 

7 0 85 48 

7 0 85 48 

7 0 85 48 

7 0 85 48 

7 0 85 48 

6 0 78 45 

6 0 78 45 

6 0 78 45 

6 0 78 45 

6 0 78 45 

6 0 78 45 

6 0 78 46 

6 0 78 45 

6 0 78 45 
6 0 78 45 
6 0 78 45 
6 0 78 45 
6 0 78 45 

6 0 78 45 

6 0 78 45 
6 0 78 45 

6 0 78 45 

6 0 78 45 

6 0 78 45 

5 0 70 42 
5 0 70 42 
5 0 70 42 

5 0 70 42 

5 0 70 42 

5 0 70 42 

6 0 70 42 

5 0 70 42 

5 0 70 42 

5 0 70 42 

5 0 70 42 

5 0 70 42 

5 0 70 42 


5 0 70 42 

5 0 70 42 

5 0 70 42 

5 0 70 42 

5 0 70 42 

6 0 70 42 

5 0 70 42 

5 0 70 42 

5 0 70 42 

4 0 60 38 

4 0 60 38 

4 0 60 38 

4 0 60 38 

4 0 60 38 

4 0 60 38 

4 0 60 38 

4 0 60 38 
4 0 60 38 

4 0 60 38 
4 0 60 38 

4 0 60 38 

4 0 60 38 
4 0 60 38 

4 0 60 38 
4 0 60 38 

4 0 60 38 

4 0 60 38 

4 0 60 38 
4 0 60 38 

4 0 60 38 

4 0 60 38 

4 0 60 38 

4 0 60 38 
4 0 60 38 
4 0 60 33 
4 0 60 3? 
3 0 48 33 

3 0 48 33 

3 0 48 33 

3 0 48 33 

3 0 48 33 

3 0 48 33 

3 0 48 33 

3 0 48 33 

3 0 48 33 


is of Table 6-A, 
logarithms of their 

■m 

F 


GS-. 

3 

0 

48 

HC . 

3 

0 

48 

HN . 

3 

0 

48 

LB 

3 

0 

48 

LC 

3 

0 

48 

LF 

3 

0 

48 

LP . 

3 

0 

48 

MC 

3 

0 

48 

NP . 

3 

0 

48 

NV , 

3 

0 

48 

NW 

3 

0 

48 

OE .. 

3 

0 

48 

OH . 

3 

0 

48 

PH. 

3 

0 

48 

PU . 

3 

0 

48 

RH 

3 

0 

48 

SB . 

3 

0 

48 

SM 

3 

0 

48 

TB 

3 

0 

48 

UB. . 

3 

0 

48 

UC... 

3 

0 

48 

UD .. 

3 

0 

48 

YI . 

3 

0 

48 

YP._ 

3 

0 

48 

AH.. 

2 

0 

30 

AK.. 

2 

0 

30 

AO... 

2 

0 

30 

BI. _ 

2 

0 

30 

BR... 

2 

0 

30 

BU . 

2 

0 

30 

DG .. 

2 

0 

30 

DH. 

2 

0 

30 

DQ... 

2 

0 

30 

FC . 

2 

0 

30 

FL . 

2 

0 

30 

GC _ 

2 

0 

30 

GF.. 

2 

0 

30 

GL . 

2 

0 

30 

GP 

2 

0 

30 

GU 

2 

0 

30 

HD 

2 

0 

30 

HM 

2 

0 

30 

IB 

2 

0 

30 

IK 

2 

0 

30 

IZ 

2 

0 

30 


F I | (2D 


2 0 30 25 
2 0 30 25 
2 0 30 25 

2 0 30 25 

2 0 30 25 

2 0 30 25 

2 0 30 25 

2 0 30 25 
2 0 30 25 
2 0 30 25 
2 0 30 25 
2 0 30 25 
2 0 30 25 

2 0 30 25 
2 0 30 25 
2 0 30 25 
2 0 30 25 
2 0 30 25 
2 0 30 25 

2 0 30 25 
2 0 30 25 

2 0 30 25 
2 0 30 25 

2 0 30 25 

2 0 30 25 

2 0 30 25 

2 0 30 25 
2 0 30 25 
2 0 30 25 
2 0 30 25 
2 0 30 25 

2 0 30 25 

2 0 30 25 

1 0 00 13 

0 00 13 

0 00 13 

0 00 13 

0 00 13 

0 00 13 

0 00 13 

0 00 13 

0 00 13 

0 00 18 
0 00 13 

0 00 13 
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Tablij 7-A, Concluded —The I#8 different digraphs of Table 6-A, arranged according to tlmr 
absolute frequencies, accompanied by the logarithms of their assigned probabilities 


F 

LwCT) 

L?24 

<zr) 

1? 

Lio(F) 

■pjsai 

F 

Lw(F) 


F 

IMF) 

Lk4 

(2F) 

CY. . 

1 

0 

00 

13 

HW 

1 

0 

00 

13 

PD . 

1 

0 

00 



■ 

0 

00 

13 

DJ _ 

1 

0 

00 

13 

HY... 

1 

0 

00 

13 

PN . 

1 

0 

00 

■lift 



0 

00 

13 

DY _ 

1 

0 

00 

13 

JA . 

1 

0 

00 

13 

PV 

1 

0 

00 

13 


n 

0 

00 

13 

EJ . 

1 

0 

00 

13 

KA... 

1 

0 

00 

13 

PW .. 

1 

0 

00 

13 


s 

0 

00 

13 

EZ... 

1 

0 

00 

13 

KC.. 

1 

0 

00 

13 

PY .. 

1 

0 

00 

13 


m 

0 

00 

13 

FD. _ 

1 

0 

00 

13 

KL._ 

1 

0 

00 

13 

QM._ 

1 

0 

00 

13 

XE. 

i 

0 

00 

13 

FG . 

1 

0 

00 

13 

KN._ 

1 

0 

00 

13 

QR - 

1 

0 

00 

13 

i 

i 

i 

En* 

X 

i 

0 

00 

13 

FM.. 

1 

0 

00 

13 

KS . 

1 

0 

00 

13 

RJ . 

1 

0 

00 

13 

XH-. 

i 

0 

00 

13 

FP.. 

1 

0 

00 

13 

LG .. 

1 

0 

00 

13 

RK 

1 

0 

00 

13 

XN . 

i 

0 

00 

13 

FW._ 

1 

0 

00 

13 

LH... 

1 

0 

00 

13 

SK . 

1 

0 

00 

13 

XO.... 

i 

0 

00 

13 

FY-. 

1 

0 

00 

13 

LN . 

1 

0 

00 

13 

SV . 

1 

0 

00 

13 

XR .. 

i 

0 

00 

13 

GD... 

1 

0 

00 

13 

MD._ 

1 

0 

00 

13 

SY .. 

1 

0 

00 

13 

XS _ 

i 

0 

00 

13 

GG... 

1 

0 

00 

13 

MF . 

1 

0 

00 

13 

TG._ 

1 

0 

00 

13 

YG... 

i 

0 

00 

13 

GJ._ 

1 

0 

00 

13 

MH .. 

1 

0 

00 

13 

TQ . 

1 

0 

00 

13 

YH__ 

i 

0 

00 

13 


1 

0 

00 

13 

NJ... 

1 

0 

00 

13 

TZ... 

1 

0 

00 

13 

YU._ 

i 

0 

00 

13 

GW_ 

1 

0 

00 

13 

NQ.. 

1 

0 

00 

13 

UF. 

1 

0 

00 

13 

YW_. 

i 

0 

00 

13 

HB.. 

1 

0 

00 

13 

OJ.. 

1 

0 

00 

13 

UO . 

1 

0 

00 

13 

ZA . 

i 

0 

00 

13 

HL_. 

1 

0 

00 

13 

OX 

1 

0 

00 

13 

UV . 

1 

0 

00 

13 

ZI .. 

i 

0 

00 

.13 

HP.. 

1 

0 

00 

13 

PB.. 

1 

0 

00 

I 13 

VO . 

1 

0 

00 

13 

5.000 




HQ .. 

1 

0 

00 

L_l§_ 

[ PC . 

1 

0 

00 

1 13 

VT . 

1 

0 

00 

13 


_j 





Table 7-B —The 18 digraphs composing 25% of the digraphs m Table 6-A, accompanied by the 
logarithms of their assigned probabilities, arranged alphabetically according to their initial letters 


(1) AND ACCORDING TO THEIR FINAL (2) AND ACCORDING TO THEIR ABSOLUTE 

LETTERS FREQUENCIES 


F 

Lw(F) 

mm 

_' 

Lw(T) 

■HI 

r 

L»(F> 

(2F) 

F 

LiofF) 

Lxn 

(2F) 

AN. 

. 64 

1 

81 

89 

ON. 

.. 77 

b 

89 

92 

AN 

. 64 

l 

81 

89 

ON.. 

77 

l 

89 

92 






OR. 

. 64 

Q 

81 

89 






OR .. 

64 

1 

81 

.89 

ED. 

.. 60 

1 

78 

88 

RE. 

00 

a 

i 

1 

99 

96 

EN. 

..111 

2 

05 

99 

RE 

98 

1 

99 

96 

EN 

..111 

2 

05 

99 






ER. 

. 87 

1 

94 

94 






ER 

.. 87 

1 

94 

94 

SE 

. 49 

l 

69 

84 

ED 

_ 60 

1 

78 

88 

ST . 

63 

1 

80 

88 

ES 

_ 64 

1 

73 

86 

ST. 

_ 63 

l 

80 

88 

ES. 

. 54 

1 

73 

86 

SE _ 

49 

1 

69 

84 






TE. 

. 71 

l 

85 

91 






TH 

78 

l 

89 

92 

IN 

... 75 

1 

88 

92 

TH. 

. 78 

1 

89 

92 

IN 

.. 75 

1 

88 

92 

TE.. 

71 

l 

85 

91 






TO.. 

50 

1 

70 

I 

84 


j 




TO.... 

50 

1 

70 

84 

ND 

. 52 

1 

72 

85 

VE. 

57 

1 

76 

87 

NT.. 

.. 82' 

1 

91 

i 

02 

VE.. 

57 

1 

76 

1 

87 

NE 

57 

1 

76 

87 

1 

,249 




NE 

- 57 

1 

76 

87 

1, 

249 




NT 

... 82 

1 

91 

93 

f 

9 \ 




ND. 

52 

1 

72 

85 
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Table 7-C —The 53 digraphs composing 50% of the 5,000 digraphs of Table 6-A, accompanied by 
the logarithms of their assigned probabilities, arranged alphabetically according to their initial letters 


(1) AND ACCORDING TO THEIR FINAL (2) AND ACCORDING TO THEIR ABSOLUTE 

LETTERS FREQUENCIES 


F 

Lia(F) 

mmm 

HI 

F 

Lio<r) 

■MM 

Bfl 

r 

Lu(D 

MM 

r 

L»(T) 

bm 

(2F) 

AL.. 

32 

1 

61 

76 

MA . 

36 

i 

56 

78 

AN. 


64 

0 

81 

89 

MA __ 

36 

l 

56 

78 

AN... 

64 

1 

81 

89 






AT 

m 

47 

0 

67 

83 






AR.. 

44 

1 

64 

82 

ND. 

52 

i 

72 

85 

AR 

m 

44 

l 

64 

82 

NT_ 

82 

l 

91 

93 

AS. 

41 

1. 

61 

80 

NE . 

57 

i 

76 

87 

AS 


41 

l 

61 

80 

NE.. 

67 

l 

76 

87 

AT... 

47 

1 

67 

83 

NI . 

30 

i 

48 

75 

AL 


32 

l 

61 

76 

ND_ 

62 

l 

72 

85 






NT. 

82 

i 

91 

93 







NI_ 

30 

l 

48 

75 

CE... 

32 

1 

51 

76 






CO 


41 

l 

61 

80 






CO__ 

41 

1 

61 

80 

ON.. 

77 

i 

89 

92 

CE 

m 

32 

l 

51 

76 

ON _ 

77 

l 

89 

92 






OR . 

64 

i 

81 

89 







OR . 

64 

1 

81 

89 

DA... 

32 

1 

51 

76 

OU . 

37 

i 

57 

79 

DE. 

m 

33 

l 

52 

77 

OU__ 

37 

l 

57 

79 

DE._ 

33 

1 

52 

77 

RA„ 

39 

i 

59 

80 

DA. 


32 

l 

51 

76 

RE_ 

98 

l 

99 

96 

EA .. 

36 

1 

54 

78 

RE . 

98 

i 

99 

96 

EN 

111 

2 

05 

99 

RT_. 

42 

l 

62 

81 

EC . 

32 

1 

51 

76 

RI._ 

30 

i 

48 

75 

ER 


87 

1 

94 

94 

RA... 

39 

l 

59 

.80 

ED__ 

60 

1 

78 

88 

RO 

28 

i 

45 

74 

ED_ 

_ 

60 

1 

78 

88 

RS . 

31 

l 

49 

76 

EE_ 

42 

1 

62 

81 

RS 

31 

i 

49 

76 

ES 


54 

1 

73 

86 

RI_. 

30 

1 

48 

75 

EL.. 

29 

1 

46 

74 

RT_. 

42 

i 

62 

81 

EE 


42 

1 

62 

81 

RO.. 

28 

l 

45 

74 

EN ._111 

2 

05 

99 






ET. 


37 

1 

57 

79 






ER.. 

87 

1 

94 

94 

SE-. 

49 

i 

69 

84 

EA 


35 

1 

54 

78 

ST_ 

63 

l 

80 

88 

ES _ 

54 

1 

73 

86 

SI._ 

34 

i 

53 

77 

EC. 

_ 

32 

1 

51 

76 

SE . 

49 

l 

69 

84 

ET_ 

37 

1 

57 

79 

ST . 

63 

i 

80 

88 

EL. 

- 

29 

1 

46 

74 

SI._ 

34 

l 

53 

77 

FI. _ 

39 

1 

59 

80 

TA . 

28 

i 

45 

74 

FO_ 


40 

1 

60 

80 

TH — 

78 

l 

89 

92 

FO_ 

40 

1 

60 

.80 

TE._ 

71 

i 

85 

91 

FI_ 


39 

1 

59 

80 

TE_. 

71 

l 

85 

91 






TH. . 

78 

i 

89 

92 







TO_ 

50 

l 

70 

84 

HI_. 

33 

1 

52 

77 

TI_. 

45 

i 

65 

82 

HI 

. 

33 

1 

52 

77 

TI.. 

45 

l 

65 

82 

HT_. 

28 

1 

45 

74 

TO 

50 

i 

70 

84 

HT. 

. 

28 

1 

46 

74 

i 

TY __ 

41 

l 

61 

80 






TW.. 

36 

i 

56 

78 






TW._ 

36 

l 

56 

78 

IN . 

76 

1 

88 

92 

TY . 

41 

i 

61 

80 

IN 


76 

1 

88 

92 

TA. 

28 

l 

45 

74 

10 . 

41 

1 

61 

80 






10. 

* 

41 

1 

61 

80 






IS . 

35 

1 

54 

78 

UR _ 

31 

i 

49 

76 

IS. 


35 

1 

54 

78 

UR.. 

31 

l 

49 

75 

LA._ 

28 

1 

45 

74 

ra 

i 

i 

57 

i 

76 

87 

LE. 


37 

1 

67 

79 

VE _ 

57 

l 

76 

87 

LE » . 

37 

JL_ 

57 

79 

2,495 




LA 


28 

1_ 

45 

74 

2,495 
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Tabie 7-D —The 122 digraphs composing 75% of the 5,000 digraphs of Table 6-A, accompanied 
by the logarithms of their assigned probabilities, arranged alphabetically according to their 
initial letters 


a) AND ACCORDING TO THEIR PINAL LETTERS 



RESTRICTE D- 


2-17 


























RESTRICTED 


REF ID: A56892 


Table 7-D, Concluded —The digraphs cothposmjj 7f8% of the 5 ffib 'digraphs of Table %-A, 

accompanied by the logarithms of their assigned probabilities, arranged alphabetically according 
to their initial letters 


(2) AND ACCORDING TO THEIR ABSOLUTE FREQUENCIES 



Table 7-E —All the 128 digraphs of Table 6-A, arranged first alphabetically according to their 
initial htteis and then alphabetically according to then final letters 

(SEE TABLE 6-A—READ ACROSS THE ROWS) 
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Table 8 —The 128 different digraphs of Table 6-A, arranged first alphabetically according to their 
initial letters and then according to their absolute frequencies under each initial letteraccompanied 
by the logarithms of their assigned probabilities 


F 

Lio(D 

Irt4 

(2D 

r 

Ii«(D 

■m 

r 

Lio(F) 

OF) 

F 

| Lw(r) | 

(2D 

AN . 

64 

l 

81! 

89 

CT . 

1 

14 

n 

15 

61 

ED 


60 

1 

78 

88 

GH__ 

20 

1 

m 

67 

AT.... 

47 

l 

67 J 

83 

Cl . 

7 


85 

48 

ES 

__ 

54 

1 

73 

86 

GE. 

14 

1 

15 

61 

AR... 

44 

l 

64 

82 

CL.. 

5 


70 

42 

EE. 

_ m 

42 

1 

62 

81 

GA . 

7 

□ 

85 

48 

AS.... 

41 

l 

61 

80 

CK . 

4 

0 

60 

38 

ET 

_ 

37 

1 

57 

79 

GO. . 

6 

0 

78 

45 

AL . 

32 

l 

61 

76 

CR.. 

4 

0 

60 

38 

EA 

— 

35 

1 

54 

78 

GI.. 

6 

0 

70 

42 

AD. . 

27 

l 

43 

73 

CU . 

4 

0 

60 

38 

EC 


32 

1 

51 

76 

GR .. 

5 

0 

70 

42 

AI_. 

17 

l 

23 

64 

CC .. 

3 

0 

48 

33 

EL 


29 

11 

46 

74 

GT_ 

4 

0 

60 

38 

AC.. 

14 

l 

15 

61 

CD... 

1 

0 

00 

13 

El 


27 

1 

43 

73 

GN.. 

3 

0 

48 

33 

AM... 

14 

l 

15 

61 

CF... 

1 

0 

00 

13 

EP 

— 

20 

1 

30 

67 

GS... 

3 

0 

48 

33 

AU . 

13 

l 

11 

59 

CM . 

1 

0 

00 

13 

EV 


20 

l 

30 

67 

GC... 

2 

0 

30 

25 

AP.... 

12 

l 

08 

58 

CN .. 

1 

0 

00 

13 

EF. 


18 

1 

26 

66 

GF.. 

2 

0 

30 

25 

AY . 

12 

l 

08 

58 

CS.. 

1 

0 

00 

13 

EM 


14 

1 

15 

61 

GL._ 

2 

o 

30 

25 

AV„ 

7 

0 

85 

48 

cw . 

1 

0 

00 

13 

EO. 


12 

1 

08 

58 

GP_. 

2 

0 

30 

25 

AB..„ 

6 

0 

78 

45 

CY.. 

1 

0 

00 

13 

EQ 


12 

l 

08 

58 

GU... 

2 

0 

30 

25 

AG .. 

6 

0 

78 

45 






EH 


7 

0 

85 

48 

GD 

1 

0 

00 

13 

AF... 

4 

0 

60 

38 

DE 

33 

1 

52 

77 

EW 


7 

0 

85 

48 

GG . 

1 

0 

00 

13 

AA _ 

3 

0 

48 

33 

DA . 

32 

1 

51 

76 

EX 


7 

0 

85 

48 

GJ... 

1 

0 

00 

13 

AW.... 

3 

0 

48 

33 

DI 

27 

1 

43 

73 

EB 


4 

0 

60 

38 

GM... 

1 

0 

00 

13 

AH _ 

2 

0 

30 

25 

DO. _. 

16 

1 

20 

63 

EG 


4 

0 

60 

38 

GW... 

1 

0 

00 

13 

AK.. 

2 

0 

30 

25 

DT . 

15 

1 

18 

62 

EY 


4 

0 

60 

38 






AO .. 

2 

0 

30 

25 

DS 

13 

1 

11 

59 

EU 


3 

0 

48 

33 




! 


AE .. 

1 

0 

00 

13 

DR . 

12 

1 

08 

58 

EJ 

... 

1 

0 

00 

13 






AJ_. 

1 

0 

00 

13 

DD.. 

8 

!o 

90 

1 51 

EZ 


1 

0 

00 

13 

HI. . 

33 

1 

52 

77 






DF. . 

8 

0 

90 

51 







HT. . 

28 

1 

45 

74 

BE_ 

18 

1 

26 

66 

DM . 

5 

io 

70 

42 

FO. 


40 

1 

60 

80 

HA_ 

20 

1 

30 

67 

BY... 

7 

0 

85 

48 

DP . 

5 

0 

70 

42 

FI 


39 

1 

59 

80 

HE... 

20 

1 

30 

67 

BL... 

6 

0 

78 

45 

DU.. 

5 

0 

70 

42 

FF 


11 

1 

04 

56 

HO... 

20 

1 

30 

67 

BA_ 

4 

0 

60 

38 

DB 

4 

0 

60 

38 

FT 


11 

I 1 

04 

56 

HR . 

17 

1 

23 

64 

BO... 

4 

0 

60 

38 

DC . 

4 

0 

60 

38 

FE 

__ 

10 

ll 

00 

55 

HU.. 

8 

0 

90 

51 

BI. . 

2 

0 

30 

25 

DN . 

4 

0 

60 

38 

FR 


9 

0 

95 

53 

HF .. 

5 

0 

70 

42 

BR... 

2 

0 

30 

25 

DW... 

4 

0 

60 

38 

FA. 

. ■ 

5 

0 

70 

42 

HS_. 

4 

0 

60 

38 

BU 

2 

0 

30 

25 

DL.. 

a 

0 

48 

33 

FS 


3 

0 

48 

33 

HC.. 

3 

0 

48 

33 

BJ. . 

1 

0 

00 

13 

DV.. 

3 

0 

48 

<53 

FU 


3 

1° 

48 

33 

HN . 

3 

0 

48- 

33 

BM .. 

1 

0 

00 

13 

DG . 

2 

0 

30 

25 

FC 

... 

2 

0 

30i 

25 i 

HD... 

2 

0 

30 

25 

BS .. 

1 

0 

00 

13 

DH 

2 

0 

30 

25 

FL 


2 

0 

30 

25 

HM .. 

2 

0 

30 

25 

BT. 

1 

0 

00 

13 

DQ... 

2 

0 

30 

26 

FD 

... 

1 

o 

00 1 

13 

HB.. 

1 

0 

00 

13 






DJ.. 

JL 

0 

00 

13 

FG 


1 

0 

oo! 

13 

HL . 

1 

0 

00 

13 

CO . 

41 

1 

61 

80 

DY.. 

1 

0 

00 

13 

FM 


1 

0 

00 

13 

HP.. 

1 

0 

00 

13 

CE 

32 

1 

51 

76 





i 

FP 


1 

0 

00 

13 

HQ . 

1 

0 

00 

13 

CA 

20 

1 

30 

67 

en. : 

111 

2 

05 

99 

FW 

m 

1 

0 

00 

13 

HW.. 

1 

0 

00 

13 

CH _ 

14 

i 

1 

15 

i 

61 

i 

ER. . 

87 

i 

1 

94 

1 

94 

- j 

FY 

- 

i 

0 

00 

13 

- 1 

HY. . 

1 

0 

00 

13 


1 For arrangement alphabetically first under initial letters and then under final letters, see Table 6-A 
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Table 8, Contd —The i%8 different. digraphs of Table "6-A, arranged first alphabetically according 
to their initial letters and then according to their absolute frequencies under each initial letter, 1 
accompanied by the logarithms of their assigned probabilities 



1 Tor airangement alphabetically first under initial letters and then under final letter*, see Table 6-A 
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Table 8, Concluded — The 1$8 different digraphs of Table 6-A, arranged first alphabetically accords 
mg to their initial letters and then according to their absolute frequencies under each initial letter, 1 
accompanied by the logarithms of theur assigned probabilities 


F 

Lio(r> 

BH 

■BQ 

Lio(F) 

mm 

F 

I*.(F) 


F 

Lm(F) 

~XkT 

(ZF) 

RE 

98 

B 


96 

SR.. 

5 

0 

70 

42 

us - 

12 

1 

08 

58 

XI._ 

2 

0 

30 

25 

RT _ 

42 

D 


81 

SN . 

4 

0 

60 

38 

UT. _ 

12 

1 

08 

58 

XP _ 

2 

0 

30 

25 

RA _ 

39 

i 

59 

80 

Sff _ 

4 

0 

60 

38 

UE-. 

11 

1 

04 

56 

XD_ 

1 

0 

00 

13 

RS 

31 

i 

49 

75 

SB. _ 

3 

0 

48 

33 

UG_. 

8 

0 

90 

51 

XE... 

1 

0 

00 

13 

RI . 

30 

i 

48 

75 

SM.. 

3 

0 

48 

33 

UL_ . 

6 

0 

78 

46 

XF.... 

1 

0 

00 

13 

RO.. 

28 

i 

45 

74 

SG _ 

2 

0 

30 

25 

UA.. 

5 

0 

70 

42 

XH... 

1 

0 

00 

13 

RD. _ 

17 

i 

23 

64 

SL _ 

2 

0 

30 

25 

UI._ 

5 

0 

70 

42 

XN_. 

1 

0 

00 

13 

RP.. 

13 

i 

11 

69 

SK.. 

1 

0 

00 

13 

UM . 

5 

0 

70 

42 

XO.. 

1 

0 

00 

13 

RR.. 

11 

i 

04 

56 

SV__ 

1 

0 

00 

13 

UB. _ 

3 

0 

48 

33 

XR. 

1 

0 

00 

13 

RC.. 

9 

0 

95 

53 

SY.. 

1 

0 

00 

13 

UC._ 

3 

0 

48 

33 

XS_. 

1 

0 

00 

13 

RM._ 

9 

0 

95 

53 






UD._ 

3 

0 

48 

33 






RY_ 

9 

0 

95 

53 

TH . 

78 

1 

89 

92 

UP__ 

2 

0 

30 

25 

YT . 

15 


m 

62 

RG _ 

7 

0 

85 

48 

TE.. 

71 

1 

85 

91 

UF_ 

1 

0 

00 

13 

YF .. 

11 

fej 


56 

RN . 

7 

0 

85 

48 

TO. . 

50 

1 

70 

84 

U0._ 

1 

0 

00 

13 

YS. _ 

11 

1 

04 

66 

RF . 

6 

0 

78 

45 

TI_. 

45 

1 

65 

82 

UV . 

1 

0 

00 

13 

YO... 

10 

1 

00 

55 

RL... 

5 

0 

70 

42 

TY . 

41 

1 

61 

80 






YE. _ 

9 

0 

95 

53 

RU 

5 

0 

70 

42 

TW. _ 

36 

1 

56 

78 

VE . 

67 

1 

76 

87 

YA.. 

6 

0 

78 

45 

RV . 

5 

0 

70 

42 

TA. 

28 

1 

45 

.74 

VI _ 

12 

1 

08 

58 

YN__ 

6 

0 

78 

45 

RW.. 

4 

0 

60 

38 

TS _ 

19 

1 

28 

67 

VA . 

6 

0 

78 

45 

YC. _ 

4 

0 

60 

38 

RH _ 

3 

0 

48 

33 

TT_ 

19 

1 

28 

67 

VO.. 

1 

0 

00 

13 

YD.. 

4 

0 

60 

38 

RB.._. 

2 

0 

30 

25 

TR. _ 

17 

1 

23 

64 

VT. . 

1 

0 

00 

13 

YR_ 

4 

0, 

60 

38 

RJ ... 

1 

0 

00 

13 

TF . 

7 

0 

85 

48 






YI_. 

3 

0 

48 

83 

RK.. 

1 

0 

00 

13 

TN_. 

7 

0 

85 

48 

WE _ 

22 

1 

34 

69 

YP _ 

3 

0 

48 

33 






TC . 

6 

0 

78 

45 

WO. 

19 

1 

28 

67 

YB._ 

2 

0 

30 

25 

ST... 

63 

1 

80 

88 

TD . 

6 

0 

78 

45 

WI _ 

13 

1 

11 

59 

YL _ 

2 

0 

30 

25 

SE_ 

49 

1 

69 

84 

TM . 

6 

0 

78 

45 

WA . 

12 

1 

08 

58 

YM.. 

2 

0 

30 

25 

SI_ 

34 

1 

53 

77 

TL . 

5 

0 

70 

42 

WH . 

4 

0 

60 

38 

YG_. 

1 

0 

00 

13 

SH. _ 

26 

1 

41 

72 

TU_ 

5 

0 

70 

42 

■p 

2 

0 

30 

25 

YH_ 

1 

0 

00 

13 

SA _ 

24 

1 

38 

71 

TB.. 

3 

0 

48 

33 

WL_. 

1 

0 

00 

13 

YU... 

1 

0 

00 

13 

SS.... 

19 

1 

28 

67 

TP.. 

2 

0 

30 

25 

WR 

1 

0 

00 

13 

4 

i 

i 

1 

0 

00 

13 

S0_.. 

15 

1 

18 

62 

TG . 

1 

0 

00 

13 

WS._ 

1 

0 

00 

13 






SC_ 

13 

1 

11 

59 

TQ_. 

1 

0 

00 

13 

WY_. 

1 

0 

00 

13 

ZE__ 

2 

0 

30 

25 

SF _ 

12 

1 

08 

58 

TZ _ 

1 

0 

00 

13 






ZA... 

1 

0 

00 

13 

SU _ 

11 

1 

04 

56 






XT. _ 

7 

0 

85 

48 

ZI.. 

1 

0 

00 

13 

SP_.._ 

10 

1 

00 

55 

UR_ 

31 

1 

49 

75 

XA. . 

2 

0 

30 

25 

5, 

D00 




SD . 

5 

0 

70 

42 

UN_ 

21 

1 

32 

68 

xc_. 

2 

0 

30 

25 


, 





1 For arrangement alphabetically first under initial letters and then under final letters, see Table 6-A. 
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Tabld 9-A —The 1$8 different digraphs of Table 6-A, arranged first alphdbetically according to 
their final letters and then according to their absolute frequencies, accompanied by the logarithms 
of their assigned probabilities 


F 

Lio(F) 

1*"* 

(ZF) 

r 

Lw(F) 

L vh 

(ZF) 

F 1 

Lio(F) 

m£Bm 

F 

Lio(F) 

JUm 

<3F) 

RA 


39 

1 

59 

80 

EC 


32 

B 


76 

RE.. 

98 

l 

99 

96 

GF__ 

2 

0 

30 

25 

MA 

.. 

36 

1 

56 

78 

IC 


22 

Q 


69 

TE.. 

71 

l 

85 

91 

PF 

2 

0 

30 

25 

EA. 

„ 

35 

1 

54 

78 

NC 


19 

0 


67 

NE... 

57 

1 

76 

87 

CF.. 

1 

0 

00 

13 

DA 

... 

32 

1 

51 

76 

AC 


14 

i 

15 

61 

VE.. 

57 

l 

76 

87 

MF... 

1 

0 

00 

13 

LA 


28 

1 

45 

74 

SC 

. 

13 

i 

11 

59 

SE 

49 

1 

fill 

84 

UF... 

1 

0 

00 

13 

TA 


28 

1 

45 

74 

RC 


9 

0 

95 

53 

EE 

42 

1 

62 

81 

XF . 

1 

0 

00 

13 

NA 


26 

1 

41 

72 

OC 


8 

0 

90 

51 

LE 

37 

l 

57 

79 






SA 


24 

1 

38 

71 

TC 


6 

0 

78 

45 

DE 

33 

l 

52 

77 






CA 


20 

1 

30 

67 

DC 

. 

4 

0 

60 

38 

CE 

32 

l 

51 

76 

MG . 

27 

1 

43 

73 

HA 


20 

1 

30 

67 

YC 


4 

0 

60 

38 

ME 

26 

l 

41 

72 

IG. 

19 

1 

28 

67 

PA. 


14 

1 

15 

61 

CC 


3 

0 

48 

33 

PE 

23 

l 

36 

70 

UG . 

8 

0 

90 

51 

WA 


12 

1 

08 

58 

HC 


3 

0 

48 

33 

WE 

22 

1 

34 

69 

RG . 

7 

0 

85 

48 

IA 


8 

0 

90 

51 

LC 


3 

0 

48 

33 

HE . 

20 

l 

Mil 

67 

AG 

6 

0 

78 

45 

GA. 


7 

0 

85 

48 

MC 


3 

0 

48 

33 

BE 

18 

l 

26 

66 

EG. 

4 

0 

60 

38 

OA 


7 

0 

85 

48 

uc 


3 

0 

48 

33 

GE . 

14 

l 

15 

61 

DG.. 

2 

0 

30 

25 

VA 


6 

0 

78 

45 

FC 

_ 

2 

0 

30 

25 

IE . 

13 

l 

11 

69 

OG . 

2 

0 

30 

25 

YA 


6 

0 

78 

45 

GC 


2 

0 

30 

25 

UE 

11 

l 


56 

SG... 

2 

0 

30 

25 

FA 


5 

0 

70 

42 

XC 


2 

0 

30 

25 

FE . 

10 

l 


55 

FG . 

1 

0 

00 

13 

UA 


5 

0 

70 

42 

KC 


1 

0 

00 

13 

YE . 

9 

0 

95 

53 

GG . 

1 

0 

00 

13 

BA 


4 

0 

60 

38 

PC 


1 

0 

00 

13 

KE 

6 

0 

78 

45 

LG. 

1 

0 

00 

13 

AA 


3 

0 

48 

33 







OE . 

3 

0 

48 

33 

TG 

1 

0 

00 

13 

XA 


2 

0 

30 

25 






1 

JE... 

2 

0 

30 

25 

YG . 

1 

0 

00 

13 

JA 


1 

0 

00 

13 

ED. 

m 

60 

1 

78 

88 

ZE .. 

2 

0 

30 

25 




1 


KA 


1 

0 

00 

13 

ND 

ur 

52 

1 

72 

85 

AE 

1 

0 


13 






ZA 


1 

0 

00 

13 

AD 


27 

1 

43 

73 

XE. 

1 

□ 


13 












RD. 


17 

1 

23 

64 






TH . 

78 

1 

89 

92 

AB 


6 

0 

78 

45 

OD 

. 

12 

1 

08 

[ 58 






SH. 

26 

1 

41 

72 

MB 


6 

0 

78 

45 

LD 

_ 

9 

0 

95 

53 


1 




GH . 

20 

1 

30 

67 

DB 


4 

0 

60 

38 

DD 


8 

0 

90 

51 

OF. 

25 

1 

gftl 

72 

CH. 

14 

1 

15 

61 

EB 


4 

0 

60 

38 

ID. 

... 

6 

0 

78 

45 

EF . 

18 

1 

26 

66 

EH . 

7 

0 

85 

48 

OB. 


4 

0 

60 

38 

TD 


6 

0 

78 

45 

SF . 

12 

1 


58 

NH 

4 

0 

60 

38 

LB. 


3 

0 

48 

33 

SD 


5 

0 

70 

42 

FF . 

11 

1 


66 

WH 

4 

0 

60 

38 

SB 


3 

0 

48 

33 

YD 


4 

0 

60 

38 

YF 

11 

1 

04 

56 

OH . 

3 

0 

48 

33 

TB 


3 

0 

48 

33 

UD 


3 

0 

48 

33 

IF 

10 

1 

eiifl 

55 

PH . 

3 

0 

48 

33 

UB 


3 

0 

48 

33 

HD 


2 

0 

30 

25 

NF 

9 

0 

95 

53 

RH . 

3 

0 

48x 

33 

IB 


2 

0 

30 

25 

CD 


1 

0 

00 

13 

DF 

8 

0 

90 

51 

AH . 

2 

0 

30 

25 

NB 


2 

0 

30 

25 

FD 


1 

0 

00 

13 

TF - 

7 

0 

85 

48 

DH 

2 

0 

30 

25 

RB 


2 

0 

30 

25 

GD 


1 

0 

00 

13 

RF. 

6 

0 

78 

45 

LH 

1 

0 

00 

13 

YB 


2 

0 

30 

25 

MD 


1 

0 

00 

13 

HF 

5 



42 

MH 

1 

0 

00 

13 

HB 


1 

0 

00 

13 

PD 


1 

0 

00 

13 

AF 

4 



38 

XH 

1 

0 

00 

13 

PB 

- 

1 

0 

00 

13 

XD 

- 

1 

0 

00 

13 | 

LF 

3 



33 j 

YH 

1 

0 

00 

13 
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Table 9-A, Contd —The k%8 different digraphs of Table 6-A, arranged first alphabetically accord¬ 
ing to their final letters and then according to their absolute frequencies, accompanied by the 
logarithms of their assigned probabilities 
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Table 9-A, Concluded —The 428 different digraphs of Table 6-A, arranged first alphabetict&ly 
according to their final letters and then according to their absolute frequencies, accompanied by 
the logarithms of their assigned probabilities 


. . F 


EH 

r 

L U (F) 

1jT4 

(2F) 

F 

Lio(r> 

I-4M 

<2F) 

r 

Lm(F> 

Lw 

(2F) 

ES_ 

54 

g 


86 

OT. 

19 

l 

28 


JU_. 

2 

0 

30 

25 

FW. 

1 

0 

00 

13 

AS. . 

41 

Q 


80 

TT 

19 

l 

28 

67 

LU_. 

2 

0 

30 

25 

YW— 

1 

0 

00 

13 

IS__ 

35 

a 


78 

DT_ 

15 

l 

18 

62 

MU . 

2 

0 

30 

25 






RS_ 

31 

i 

49 

75 

YT.. 

16 

l 

18 

62 

YU 

1 

0 

00 

13 

TY 


1 

i a 


NS.. 
SS . 

24 

19 

i 

i 

38 

28 

71 

67 

CT_ 

UT 

14 

12 

l 

l 

15 

08 

61 

58 

IV.. 

25 

1 

40 

72 

JLA_ 

EX... 

flY 

Xu 

7 

1 

X 

0 

A 

Xo 

85 

nn 

Da 

48 

TS. 

19 

i 

28 

67 

FT. 

11 

l 

04 

56 

EV . 

20 

1 

30 

67 

UA _ 

X 

u 

uu 

XO 

0S__ 

14 

i 

16 

61 

LT 

8 

0 

90 

51 

AV_. 

7 

0 

85 

48 






DS_ 

13 

i 

11 

59 

PT 

8 

0 

90 

51 

OV . 

7 

0 

85 

48 

TY._ 

41 

1 

61 

80 

US_.. 

12 

i 

08 

58 

XT 

7 

0 

85 

48 

RV 

5 

0 

70 

42 

AY... 

12 

1 

08 

58 

YS. _ 

11 

i 

04 

56 

GT 

4 

0 

60 

38 

DV. 

3 

0 

48 

33 

LY... 

10 

1 

00 

55 

LS._ 

6 

0 

78 

45 

MT 

2 

0 

30 

25 

NV . 

3 

0 

48 

33 

RY... 

9 

0 

96 

53 

PS. . 

6 

0 

78 

45 

BT 

1 

0 

00 

13 

LV 

2 

0 

30 

25 

BY .. 

7 

0 

85 

48 

HS_ 

4 

0 

60 

38 

VT. 

1 

0 

00 

13 

PV . 

1 

0 

00 

13 

NY_ 

5 

0 

70 

42 

MS _ 

4 

0 

60 

38 






SV... 

1 

0 

00 

13 

ET.. 

4 

0 

60 

38 

FS_ 

3 

0 

48 

33 

OU. 

37 

1 

67 

79 

UV . 

1 

0 

00 

13 

MY_ 

2 

0 

30 

25 

GS. . 

3 

0 

48 

33 

QU . 

. 15 

1 

18 

62 




1 

1 

OY... 

2 

0 

30 

25 

BS „ 

1 

0 

00 

13 

AU 

13 

1 

11 

59 

TW.. 

36 

1 

56 

78 

CY_ 

1 

0 

00 

13 

CS _ 

1 

0 

00 

13 

SU. 

11 

1 

04 

• 56 

OW.. 

8 

0 

90 

51 

DY.._ 

1 

0 

00 

13 

KS 

1 

0 

00 

13 

HU . 

8 

0 

90 

51 

EW_. 

7 

0 

85 

48 

FY_ 

1 

0 

00 

13 

WS _ 

1 

0 

00 

13 

NU _ 

7 

0 

85 

48 

DW . 

4 

0 

60 

38 

HY. _ 

1 

0 

00 

13 

xs _ 

1 

0 

00 

13 

DU . 

5 

0 

70 

42 

RW__ 

4 

0 

60 

38 

PY_ 

1 

0 

00 

13 






RU . 

5 

0 

70 

42 

SW... 

4 

0 

60 

38 

SY.. 

1 

0 

00 

13 

NT 

82 

1 

91 

93 

TU . 

5 

0 

70 

42 

AW . 

3 

0 

48 

33 

WY.._ 

1 

0 

00 

13 

ST_ 

63 

1 

80 

88 

CU.. 

4 

0 

60 

38 

NW_ 

3 

0 

48 

33 






AT_ 

47 

1 

67 

83 

EU . 

3 

0 

48 

33 

LW_. 

2 

0 

30 

25 

T7 

n 

A 

on 

OK 

RT_ 

42 

1 

62 

81 

FU . 

3 

0 

48 

33 

CW... 

1 

0 

00 

13 

Xju 

t?*7 

a 

1 

u 

A 

oU 

nn 

1 Q 

ET_ 

37 

1 

67 

79 

PU 

3 

0 

48 

33 

FW. . 

1 

0 

00 

13 

till .1 . . 

m«7 

X 

1 

u 

n 

UU 

A A 

• XO 
1 Q 

HT . 

28 

1 

45 

74 

BU._ 

2 

0 

30 

25 

GW... 

1 

0 

00 

13 

X4J _ . 

1 

u 

UU 

XO 

IT _ 

27 

1 

43 

73 

GU__ 

2 

0 

30 

25 

HW . 

i! 

i 

0 

00 

13 | 

5,i 

ooo 





2-24 


RESTRICTE D 









REF ID:A56892 


nEOTRICTEB - 


i 


Table 9-B —The 18 digraphs composing 25% of the 5,000 digraphs of Table 6-A, accompanied by 
the logarithms of their assigned probabilities, arranged alphabetically according to their final letters 


(1) AND ACCORDING TO THEIR INITIAL (2) AND ACCORDING TO THEIR ABSOLUTE 
LETTERS FREQUENCIES 


F 

Ii»(F) 

La4 
( 2 r) 

F 

Lio(F) 

m&M 

F 

L»(F) 

■usm 

■•J >■ 

F 

Luff’) 

(2F) 

ED... 60 

1 78 

88 

IN.. 75 

1 88 

—«— 

92 

ED.. 60 

1 78 

88 

IN . 75 

1 88 

92 

ND. 52 

1 72 

85 

ON.. 77 

1 89 

92 

ND... 52 

1 72 

85 

AN— 64 

1 81 

89 

NE. 57 

1 76 

87 

TO.. 50 

1 70 

84 

RE. 98 

1 99 

96 

TO .. 50 

1 70 

84 

RE... 98 

1 99 

96 




TE... 71 

1 85 

91 




SE_49 

1 69 

84 

ER-. 87 

1 94 

94 

NE... 57 

1 76 

87 

ER... 87 

1 94 

94 

TE . 71 

1 85 

91 

OR._ 64 

1 81 

89 

VE . 57 

1.76 

87 

OR. 64 

1 81 

89 

VE_. 57 

1 76 

87 




SE.. 49 

1 69 

.84 







ES_64 

1 73 

86 




ES . 64 

1 73 

86 

TH... 78 

1 89 

92 

NT. . 82 

1 91 

93 

TH . 78 

1 89 

92 

NT_82 

1 91 

93 




ST... 63 

1 80 

88 


i 


ST.... 63 

1 80 

88 

AN.... 64 

1 81 

89 

1,249 



EN. Ill 

2 05 

99 

1,249 



EN ...111 

2 05 

99 




ON. 77 

1 89 

92 


i 

1 



Table 9-C — The 58 digraphs composing 50% of the 5,000 digraphs of Table 6-A, accompanied by 
the logarithms of their assigned probabilities, arranged alphabetically according to their final letters 


(1) AND ACCORDING TO THEIR INITIAL LETTERS 
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Tablh 9-C, Concluded —The 58 digraphs composing 50% of the 5,000 digraphs of Table 6-A, 
accompanied by the logarithms of their assigned probabilities, arranged alphafoeticaU/y according to 
their final letters 


<2) AND ACCORDING TO THEIR ABSOLUTE FREQUENCIES 


F 

Luff) 

n 

I 

F 

Ltt(F> 


F 

Lw<n 


F 

Lw(T) 

(2$ 

RA„ 39 

BPS 

80 

EE.. 42 

M 

81 

EN..1U 

2 06 

99 

ES_64 

IBI 

.86 

MA86 

Uli 

78 

LE_. 37 


79 

ON . 77 

1 89 

92 

AS .„ 41 


80 

EA._ 36 

1 64 

78 

DE._ 33 

1 62i 

77 

IN _ 76 

1 88 

92 

IS.. 86 


78 

DA.. 82 

1 61 

76 

CE . 32 

1 61 

76 

AN 64 

1.81 

.89 

RS. 31 

1 49 

76 

LA _ 28 

1 46 

.74 










TA 4 . 28 

1 46 

74 

TH . 78 

1 89 

92 




NT-. 82 

1.91 

93 







TO . 60 

1 70 

84 

ST_. 63 

1 80 

88 

EC. . 32 

1 61 

76 




CO . 41 

1 61 

.80 

AT. . 47 

1 67 

83 




TI 46 

1 65 

82 

10 . 41 

1 61 

80 

RT.. 42 

1.62 

81 

ED .. 60 

1 78 

88 

FI.. 89 

1 69 

80 

FO _ 40 

1 60 

80 

ET.. 87 

1 67 

79 

ND.. 62 

1 72 

.86 

SI.. 34 

i 53 

77 

RO . 28 

1 46 

74 

HT_28 

1 46 

74 




HI.. 33 

1 62 

77 







RE.. 98 

1 99 

96 

HI.. 30 

1 48 

76 




OU „ 87 

1 57 

79 

TE._ 71 

1 86 

.91 

RI.. 30 

1 48 

76 

ER 87 

1 94 

94 

TW-. 36 

1.56 

78 

NE... 67 

1 76 

87 




OR _ 64 

1 81 

89 




VE .. 67 

1 76 

87 

AL. 32 

1 61 

76 

AR . 44 

1 64 

.82 

TY__ 41 

1 61 

.80 

SE . 49 

1.69 

84 

EL _ 29 

1 46 

74 

UR. 31 

1 49 

76 

2,496 




Table 9-D —The 122 digraphs composing 75% of the 5,000 digraphs of Table 6-A, accompanied by 
the logarithms of their assigned probabilities, arranged alphabetically according to their final letters 
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Table 9-D, Conld —The 122 digraphs composing 75% of the 5,000 digraphs of Table 6-A, ac¬ 
companied by the logarithms of their assigned probabilities, arranged alphabetically according to 
their final letters 


(1) AND ACCORDING TO THEIR INITIAL LETTERS—Concluded 



(2) AND ACCORDING TO THEIR ABSOLUTE FREQUENCIES 


(2F) 


L»(P) 


Li 0 (r) 


L"h 

(237) 


Ljo(F) 


“EmT 

(2T) 


Lio(r> 


Lm 

(2F) 


RA. 

MA 

EA„ 

DA 

LA. 

TA. 

NA.. 

SA. 

CA. 

HA. 

PA. 

EC. 
IC 
NC 
AC. 

ED. 
ND. 
AD 
RD 


39 

36 

35 

32 

28 

28 

26 

24 

20 

20 

14 

32 

22 

19 

14 

60 

52 

27 

17 


59 

56 

54 

51 

45 

45 

41 

38 

30 

30 

15 


1 51 
1 34 
1 28 
1 15 

,1 78 
1 72 
1 43 
1 23 


80 

78 

78 

76 

74 

74 

72 
71 
67 
67 
61 

76 

69 

67 

61 

88 

85 

73 
64 


RE.. 

TE 

NE 

VE 

SE. 

EE 

LE. 

DE. 

CE. 

ME 

PE 

WE. 

HE 

BE 

GE 

IE.. 

OF 

EF. 

NG. 

IG 


98 

71 

57 

57 

49 

42 

37 

33 

32 

26 

23 

22 

20 

18 

14 

13 

25 

18 

27 

19 


99 

85 

76 

76 

69 

62 

57 

52 

51 

41 

36 

34 

30 

26 

15 

11 


1 40 
1 26 

1 43 
1 28 


96 

91 

87 

87 

84 

81 

79 

77 

76 

72 

70 

69 

67 

66 

61 

59 

72 
66 

73 
67 


TH. 

SH 

GH. 

CH 

TI 

FI. 

SI. 

HI. 

NI 

RI 

DI 

El. 

LI 

AI.. 

AL 

EL 

LL. 

IL. 

OL. 


78 

26 

20 

14 

45 

39 

34 

33 

30 

30 

27 

27 

20 

17 

32 

29 

27 

23 

19 


1 89 
1 41 
1 30 
1 15 


65 

59 

53 

52 

48 

48 

43 

43 

30 

23 


1 51 
1 46 
1 43 
1 36 
1 28 


92 

72 
67 
61 

82 

80 

77 

77 

76 

75 

73 

73 
67 
64 

76 

74 
73 
70 
67 


OM. 
AM 

EM. 

EN. . 

ON. 
IN 
AN 
UN. 

TO¬ 
CO. 
10. 
FO. 
RO . 
HO. 
WO. 
NO. 
PO 
DO 
SO. 
LO. 


. 25 
. 14 
. 14 

111 
. 77 
. 75 
. 64 
. 21 

. 50 
. 41 
. 41 
. 40 
. 28 
. 20 
. 19 
18 
17 
. 16 
. 15 
. 13 


1 40 
1 15 

1 15 

2 05 
1 89 
1 88 
1 81 
1 32 


70 

61 

61 

60 

45 

30 

28 

26 

23 

20 

18 

11 


72 

61 

61 

99 

92 

92 

89 

68 

84 

80 

80 

80 

74 

67 

67 

66 

64 

63 

62 

59 
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Table 9-D, Concluded —The 122 digraphs composing 75% of the 5,000 digraphs of Table 6-A, 
accompanied by the logarithms of their assigned probabilities, arranged alphabetically according to 
their final letters 

(2) AND ACCORDING TO THEIR ABSOLUTE FREQUENCIES—Concluded 


F 

L m (F) 

I>U4 

(2H 

F 

Lio(F) 

Lei 

(2F) 

1 F 

LiflCF) 

Lw 

(2F) 

F 

L»(F) 

LlU 

(2F) 

OP„_ 26 

1 40 

72 

ES.. 54 

IBS 

86 


1 91 

93 

OU. 37 

1 67 

79 

EP_. 20 

1 80 

67 

AS.. 41 

IH«il 

80 


1 80 

88 

QU . 16 

1 18 

62 




IS. 85 


78 

AT . 47 

1 67 

83 

AU_. 18 

1 11 

59 




RS_ 81 

1 49 

75 

RT. 42 

1 62 

81 

Ttr OE 


70 

ER . 87 

1 94 

94 

NS . 24 

1 38 

71 

ET 87 

1 57 

79 

XV_ £iD 

mr on 

1 4U 

1 OA 

1 Ok 

arj 

OR64 

1 81 

89 

SS _ 19 

1 28 

67 

HT . 28 

1 45 

74 

CiV — ct\J 

1 oU 

0 1 

AR. 44 

1 64 

82 

TS. 19 

1 28 

67 

IT 27 

1 43 

73 

Tff .„ 36 

1 56 

.78 

UR... 81 

1 49 

75 

OS 14 

1 15 

61. 

OT . 19 

1 28 

67 

TV 1 K 

1 1Q 

AO 

1R.. 27 

1 48 

78 

DS_ 18 

1 11 

59 

TT . 19 

1 28 

67 

XA- .. la 

X lo 


PR.. 18 

1 26 

66 




DT . 15 

1 18 

62 

TY. 41 

1 61 

80 

HR._ 17 

1 28 

64 




YT . 15 

1 18 

62 

3,746 



TR_17 

1 23 

64 




CT . 14 

1 15 

61 



1 


Table 9-E —All the 428 different digraphs of Table 6-A, arranged alphabetically first according to 
their final letters and then according to their initial letters 
(SEE TABLE 6-A—READ DOWN THE COLUMNS) 


Table 10-A —The 56 trigraphs appearing 100 or more times m the 50,000 letters of Governmental 
plain-text telegrams, arranged according to their absolute frequencies, accompanied by the loga¬ 
rithms of their assigned probabilities 


F 

LioCF) 

Liu 

(2F) 

F 

1 LioCF) 1 


F 

lao(D 

Lm 

(21?) 

ENT_ 

569 

2 

76 

99 

TOP_ 

. 174 

2 

24 

82 

EIG_ 

135 

2 

13 

79 

ION_ 

260 

2 

41 

88 

NTH... 

_ 171 

2 

23 

82 

FIV_ 

135 

2 

13 

79 

AND_ 

228 

2 

36 

86 

TWE_ 

170 

2 

23 

82 

MEN. ... 

131 

2 

12 

78 

ING_ 

226 

2 

35 

86 

TWO ... 

. 163 

2 

21 

81 

SEV. .. ._ 

131 

2 

12 

78 

IVE 

225 

2 

35 

86 

ATI.. . 

. 160 

2 

20 

81 

ERS.. .. 

126 

2 

10 

78 

TIO_ 

221 

2 

34 

85 

THR_ 

._ 158 

2 

20 

81 

UND.... 

125 

2 

10 

78 

FOR . 

218 

2 

34 

85 

NTY. ~ 

.. 157 

2 

20 

81 

NET.. . 

118 

2 

07 

77 

OUR.. 

211 

2 

32 

85 

HRE -. 

. 153 

2 

18 

80 

PER ... . 

115 

2 

06 

76 

THI... _. 

211 

2 

32 

85 

WEN_ 

._ 153 

2 

18 

80 

STA_ 

116 

2 

06 

76 

ONE_ 

210 

2 

32 

85 

FOU ... 

_. 152 

2 

18 

80 

TER . .. 

1J'» 

2 

05 

76 

NIN 

207 

2 

32 

85 

ORT. .. 

_ 146 

2 

16 

80 

EQU . _ 

lil 

2 

06 

76 

STO .... 

202 

2 

31 

84 

REE 

146 

2 

16 

80 

RED.. . 

113 

2 

05 

76 

EEN. 

196 

2 

29 

84 

SIX 

. 146 

2 

16 

80 

TED . . 

112 

2 

05 

76 

GHT_ 

196 

2 

29 

84 

ASH . 

. 143 

2 

16 

80 

ERI_. 

109 

2 

04 

76 

INE._ . 

192 

2 

28 

83 

DAS __ 

_. 140 

2 

16 

79 

HIR 

106 

2 

03 

75 

VEN . . 

190 

2 

28 

83 

IGH . 

140 

2 

16 

79 

IRT .. 

105 

2 

02 

75 

EVE.. _ 

177 

2 

25 

82 

ERE_ 

. 138 

2 

14 

79 

DER 

101 

2 

00 

74 

EST_ 

176 

2 

26 

82 

COM. . 

136 

2 

13 

79 

DRE - - 

100 

2 

00 

74 

TEE 

174 

2 

24 

82 

ATE _ 

. 135 

2 

13 

79 
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Tabld 10-B —The SO tngrapks appearing 100 or more times in the 80,000 letters of Governmental 
plain-text telegrams, arranged first alphabetically according to their initial letters, and then 
according to their absolute frequencies, accompanied by the logarithms of their assigned pi obabilities 



Tabus 10-C —The 56 trigraphs appearing 100 or more times m the 50,000 letters of Governmental 
plain-text telegrams, arranged first alphabetically according to their centred letters, and then ac¬ 
cording to their absolute frequencies, accompanied by the logarithms of their assigned probabilities 
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Table 10-C, Concluded —The 56 tngrapks appearing 100 or more times in the 50,000 letters of 
Governmental plain-text telegrams, arranged first alphabetically according to their central letters, 
and then according to their absolute frequencies, accompanied by the logarithms of then assigned 
probabilities 


F 

Lw<r) 

Lin 

(2D 

r 

La>CT) 

(2D 

F 

LuOF) 

~G7 

(2D 

EQU_ ^ 

114 

2 06 

76 

EST 

. 176 

2 25 

82 

OUR . .. 

211 

2 82 

85 





ASH 

.... 143 

2 16 

80 





HRE. 

153 

2 18 

80 









0RT_. 

146 

2 16 

80 

STO 

_ . 202 

2 81 

84 

IVE_ 

225 

2 35 

86 

ERE. 

138 

2 14 

79 

NTH 

. 171 

2 23 

82 

EVE.. . 

177 

2 25 

82 

ERS .... 

126 

2 10 

78 

ATI 

160 

2 20 

81 





ERI. 

109 

2 04 

76 

NTY 

157 

2 20 

81 





IRT .... 

105 

2 02 

75 

ATE 

.. 135 

2 18 

79 

TWE.. 

170 

2 28 

82 

DRE . ... 

100 

2 00 

74 

STA 

115 

2 06 

76 

TWO .. . 

163 

2 21 

81 


Table 10-D —The 56 tngraphs appearing 100 or more times in the 50,000 letters of Governmental 
plain-text telegrams, arranged first alphabetically according to their final letters, and then according 
to their absolute frequencies, accompanied by the logarithms of their assigned probabilities 


F j 

Lu(F) | 

Diu 

(2F) 

STA_.. 

... 116 

2 

06 

76 

AND.... 

... 228 

2 

36 

86 

UND.. _ 

.. 125 

2 

10 

78 

RED... . 

. 113 

2 

05 

76 

TED.... 

... 112 

2 

05 

76 

IVE.... 

226 

2 

85 

86 

ONE.... 

_. 210 

2 

32 

85 

INE... 

... 192 

2 

28 

83 

EVE... 

... 177 

2 

25 

82 

TEE_ . 

.. 174 

2 

24 

82 

TWE... 

170 

2 

23 

82 

HRE . 

. 163 

2 

18 

80 

REE... 

.. 146 

2 

16 

80 

ERE.. 

.. 138 

2 

14 

79 

ATE.. _ 

.. 135 

2 

13 

79 

DRE... 

. 100 

2 

00 

74 

ING. . 

... 226 

2 

35 

86 

EIG_.. 

... 135 

2 

13 

79 

NTH_ 

.. 171 

2 

23 

82 

ASH. . 

. 143 

2 

16 

80 

IGH . 

.. 140 

2 

u 

79 


F 

Lk»(F) 

Dim 

(2F) 

THI. . 211 

2 32 

85 

ATI.. . 160 

2 20 

81 

ERI.. . 109 

2 04 

76 

COM. _ 136 

2 13 

79 

ION_ . . 260 

2 41 

88 

NIN— __ 207 

2 32 

85 

EEN~ 196 

2 29 

84 

VEN. . 190 

2 28 

83 

WEN .. 153 

2 18 

80 

MEN .. 131 

2 12 

78 

TIO._ .. 221 

2 34 

85 

STO . .. 202 

2 31 

84 

TWO. . 163 

2 21 

81 

TOP. .. . 174 

2 24 

82 

FOR. _ 218 

2 34 

85 

OUR . 211 

2 32 

85 

THR 158 

2 20 

81 

PER . . 115 

2 06 

76 


F 

L»(F) 

1*4 

(2F) 

TER_115 

2 06 

76 

HIR . - 106 

2 03 

76 

DER . . 101 

2 00 

74 

DAS . ... 140 

2 15 

79 

ERS.. . . 126 

2 10 

78 

ENT. 669 

2 76 

99 

GHT. . ._ 196 

2 29 

84 

EST ~ 176 

2 25 

82 

ORT . .. 146 

2 16 

80 

NET . . 118 

2 07 

77 

IRT. .. 105 

2 02 

75 

FOU. . .. 162 

2 18 

80 

EQU _ 114 

2 06 

.76 

FIV 135 

2 13 

79 

SEV ~ 131 

2 12 

78 

SIX - 146 

2 16 

80 

NTY . . 157 

2 20 

81 
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Table 11-A —The 5k tetragrapks appcan 
plan-text telegrams, arranged according 
of their assigned probabilities 


ng BO or more times m the 50,000 letters of Governmental 
to their absolute frequencies, accompanied by the logarithms 


ZJ 

Lk>(F) | 

lau 

.... y 1 

Lio(F) 

Ism 

<2F) 

(2D 

TION. .. 

218 

2 

34 

99 

THIR ... 

104 

2 

02 

87 

EVEN .. 

168 

2 

23 

95 

EENT .. 

102 

2 

01 

87 

TEEN 

163 

2 

21 

94 

REQU 

98 

1 

99 

86 

ENTY. ... 

161 

2 

21 

94 

HIRT.. 

97 

1 

99 

86 

STOP... 

154 

2 

19 

93 

COMM . 

93 

1 

97 

85 

WENT. . 

153 

2 

18 

93 

QUES .. 

87 

1 

94 

84 

NINE .. 

153 

2 

18 

93 

UEST.. . 

87 

1 

94 

84 

TWEN ... . 

152 

2 

18 

93 

EQUE . 

86 

1 

93 

84 

THRE 

149 

2 

17 

93 

NDRE . .. 

77 

1 

89 

82 

FOUR . 

144 

2 

16 

92 

OMMA. .. 

71 

1 

85 

81 

IGHT .. 

140 

2 

15 

92 

LLAR. . 

71 

1 

85 

81 

FIVE .. 

135 

2 

18 

91 

OLLA.. . 

70 

1 

85 

81 

HREE .. . 

134 

2 

13 

91 

VENT.. .. 

70 

1 

85 

81 

DASH .. . 

132 

2 

12 

91 

DOLL . . 

68 

1 

83 

80 

EIGH . 

132 

2 

12 

91 

LARS . . 

68 

1 

83 

80 

SEVE ... 

121 

2 

08 

89 

THIS 

68 

1 

83 

80 

ENTH_ 

114 

2 

06 

89 

PERI ... 

67 

1 

83 

80 

MENT. 


2 

05 

88 

ERIO. 

66 

1 

82 

80 

Table 11-B —The 5k tetragrapks appearing 50 or more tv 

S 

A 

09 

0 


ASHT. 

HUND 

DRED. 

RIOD. 

IVED 

ENTS 

FFIC 

FROM. 

IRTY. 

RTEE. 

UNDR. 

NAUG. 

OURT. 

UGHT. 

STAT. 

AUGH. 

CENT 

FICE 


plain-text telegrams, arranged first alphabetically according to their initial 
cording to thevr absolute frequencies, accompanied by the logarithms of their < 


l»cf) I ka 


* L«(T) ( 2D 

64 1 81 .79 
64 1 81 79 
63 1 80 79 
63 1 80 .79 
62 1 79 78 
62 1 79 78 
62 1 79 78 
69 1 77 78 
69 1 77 78 
59 1 77 78 
69 1 77 78 
66 1 76 77 
56 1 75 .77 
56 1 75 77 
54 1 73 t 6 
52 1 72 76 
52 1 72 76 
50 1 70 


of Governmental 
’s, and then aj> 
ied probabilities 


ASHT a _ 

AUGH_ .. 


6411 81 


DASH _.. 
DOLL. _. 
DREb. . 

EVEN__ 

ENTY_ 

EIGH.. .. 
ENTH 
EENT ... 

EQUE_ 

ERIO.. .. 
ENTS_ 


132 2 12 
68 1 83 


114 2 06 
102 2 01 


FOUR.. . 

FIVE. 

FFIC_ 

FROM. .. 
FICE. 


p 

Lio(F) 

HREE .. 134 

2 13 

HIRT. .. 97 

1 99 

HUND .... 64 

1 81 

IGHT. . 140 

2 15 

IVED ... 62 

1 79 

IRTY._ 69 

1 77 

LLAR ._ 71 

1 85 

LARS . 68 

1 83 

MENT_111 

2 05 

NINE . 163 

2 18 

NDRE.. .. 77 

1 89 

NAUG_ 56 

1 76 

OMMA__ 71 

1 85 

OLLA.. .. 70 

1 85 

OURT_ 56 

1 75 

PERI_ 67 

1 83 

QUES... . 87 

1 94 


P 

Lio(D 

ZjU4 

(2D 

REQU. . .. 98 

1 99 

$6 

RIOD_ 63 

1 80 

.7i9 

RTEE. 59 

1 77 

78 

I 

STOP.. ._ 154 

2 19 

98 

SEVE.... 121 

2 08 

89 

STAT. . . 64 

1 73 

76 

TION . 218 

2 34 

99 

TEEN.. _ 163 

2 21 

94 

TWEN_152 

2 18 

93 

THRE. 149 

2 17 

93 

THIR. 104 

2 02 

87 

THIS . . 68 

1 83 

80 

UEST_ 87 

1 94 

84 

UNDR... 59 

1 77 

78 

UGHT ... 56 

1 75 

77 

VENT .. 70 

1 85 

81 

WENT. 153 

2 18 

93 
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Table 11-C —The 5k letragraphs appearing 50 or more times in the 50,000 letters of Governmental 
plain-text telegrams, arranged first alphabetically according to their second letters, and then ac¬ 
cording to their absolute frequencies, accompanied by the logarithms of their assigned probabilities 


F 

I/io(F) 

Liu 

<2F) 

F 

DASH. 

132 

2 

12 

91 i 

TION. . 

218 

LARS_ 

68 

1 

83 

80 

NINE_ 

153 

NAUG. 

66 

1 

76 

77 

FIVE_ . 

135 






EIGH. .. 

132 

NDRE_ 

77 

1 

89 

82 

HIRT_ 

97 

TEEN_ 

163 

2 

21 

94 

RIOD . . 

63 

WENT_ 

163 

2 

18 

93 

FICE . 

50 

SEVE 

121 

2 

08 

89 



MENT. 

111 

2 

05 

88 

LLAR 

71 

EENT. 

102 

2 

01 

87 

OLLA . 

70 

REQU 

98 

1 

99 

86 



UEST_ 

87 

1 

94 

84 

OMMA 

71 

VENT__ 

70 

1 

85 

81 



PERI_ 

67 

1 

83 

80 

ENTY . 

161 

CENT_ 

62 

1 

72 

76 

ENTH. 

114 






ENTS. . 

62 

FFIC... - 

62 

1 

79 

78 

UNDR. . 

59 

IGHT_ 

140 

2 

16 

92 



UGHT_ 

56 

1 

75 

77 

FOUR 

144 






COMM . . 

93 

THRE. 

149 

2 

17 

93 

DOLL... . 

68 

THIR 

104 

2 

02 

87 



THIS „. . 

68 

1 

83 

80 

EQUE. 

86 


I*w(F) 

Lm 

(2D 

F 

Lu(F) 

Lut 

(2D 

2 34 

99 

HREE 134 

2 13 

91 

2 18 

93 

ERIO_ 66 

1 82 

80 

2 13 

91 

DRED... 63 

1 80 

79 

2 12 

91 

FROM_ 69 

1 77 

78 

1 99 

86 

IRTY... 59 

1 77 

78 

1 80 

79 




1 70 

75 

ASHT_ 64 

1.81 

79 

1 85 

81 

STOP. 154 

2 19 

93 

1 85 

81 

RTEE. . 69 

1 77 

78 

1 85 

81 

STAT_ 64 

1 73 

76 

9 5>1 

Qi 

QUES. _. 87 

1 94 

84 

Li al 

o HA 

OQ 

HUND_ 64 

1 81 

79 

a UD 
1 70 

no 

OURT_ 56 

1 76 

77 

X (u 

1 77 

1 o 

78 

AUGH. . 52 

1 72 

76 

2 16 

92 

EVEN .. 168 

2 23 

95 

1 97 

85 

IVED... 62 

1 79 

78 

1 83 

80 






TWEN_152 

2 18 

93 

1 93 

84 





Tablb 11-D —The 5k tetragraphs appearing 50 or more times m the 50,000 letters of Governmental 
plain-text telegrams, arranged first alphabetically according to their third letters, and then according 
to their absolute frequencies, accompanied by the logarithms of their assigned probabilities 


F 

Lio(F) 

Ltt4 

(2F) 

F 

Lm(F) 

L.224 

(2F) 

F 

Lm(F) 

Irttt 

(2D 

LLAR .. 71 

STAT_ 54 

FICE_ 50 

UNDR . .. 69 

EVEN. 168 

TEEN_163 

TWEN.. .. 152 
HREE134 

QUES_ 87 

DRED_ 63 

IVED. . 62 

RTEE.. .. 59 

1 85 
1 73 

1 70 

1 77 

2 23 
2 21 
2 18 
2 13 
1 94 
1 80 
1 79 
1 77 

81 

76 

.76 

78 

95 

94 

93 

91 

84 

79 

78 

78 

EIGH_132 

AUGH. .. 52 

IGHT_140 

ASHT .._ 64 

UGHT_ ._ 66 

THIR .- 104 
THIS .68 
ERIO ... 66 

FFIC... . 62 

OLLA 70 

DOLL_68 

2 12 

1 72 

2 15 
1 81 

1 75 

2 02 
1 83 
1 82 
1 79 

1 85 
1 83 

91 

76 

92 

79 

77 

87 

80 

80 

78 

81 

80 

COMM . . 93 

OMMA . . 71 

WENT ... 163 
NINE .. 153 
MENT ... Ill 

EENT_102 

VENT _. . 70 

HUND _. . 64 
CENT . 52 

TION -- 218 
STOP — 154 
RIOD .. 63 

FROM . 59 


85 

81 

93 

93 

88 

87 

81 

79 

76 

99 

93 

79 

78 
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Table 11-D, Concluded —The Bit tetragrapks appearing BO or more times m the 50,000 letters of 
Governmental plain-text telegrams, arranged first alphabetically according to their third letters, and 
then according to their absolute frequencies, accompanied by the logarithms of their assigned 
probabilities 


F 

Lio(T) 

Lm 

(2F) 

F 

Lio(F) 

Lrt 

< 2 r> 

r 

Lio(F) 

tint 

(2D 

REQU.... 

98 

1 99 

86 

DASH. . . 

132 

2 12 

91 

FOUR. . 

144 

2 16 

92 





UEST.. . 

87 

1 94 

84 

EQUE. 

86 

1.93 

84 

THRE . 

149 

2 17 

93 





NAUG.... 

66 

1 76 

77 

HIRT 

97 

1 99 

86 









NDRE .. 

77 

1 89 

82 

ENTY. 

161 

2 21 

94 





LARS. .. 

68 

1 83 

80 

ENTH.. . 

114 

2 06 

89 





PERI. . . 

67 

1 83 

80 

ENTS .. 

62 

1 79 

78 

FIVE ... 

135 

2 13 

91 

OURT 

56 

1 75 

77 

IRTY. .. 

59 

1 77 

78 

SEVE.. .. 

121 

2 08 

89 


Table 11-E —The 5U tetragraphs appearing 50 or more times m the 50,000 letters of Governmental 
plain-text telegrams, arranged first alphabetically according to their final letters, and then according 
to their absolute frequencies, accompanied by the logarithms of tlrnr assigned probabilities 


F 

ZjuCD 

Lnt 

(2D 

OMMA .. 71 

IlSfl 

81 

OLLA . . 70 

1m 

81 

FFIC.... 62 

1 79 

78 

HUND .. . 64 

1 81 

79 

DRED... 63 

1 80 

79 

RIOD. 63 

1 80 

79 

IVED. 62 

1 79 

78 

NINE. . 153 

2 18 

93 

THRE_149 

2' 17 

93 

FIVE. 135 

2 13 

91 

HREE ... 134 

2 13 

91 

SEVE . 121 

2 08 

89 

EQUE ... 86 

1 93 

84 

NDRE . 77 

1 89 

82 

RTEE ._ 59 

1 77 

78 

FICE .. 50 

1 70 

75 

NAUG ... 56 

1 75 

77 


r 

Lu.cn 

Lan 

(2D 


DASH . 132 

2 12 

91 

QUES.... 

EIGH . 132 

2 12 

91 

THIS. 

ENTH . 114 

2 06 

89 

LARS. 

AUGH . 52 

1 72 

76 

ENTS. 

PERI 67 

1 83 

80 

WENT_ 

DOLL 68 

1 83 

80 

IGHT . 
MENT. 

COMM . 93 

1 97 

85 

EENT. .. 

FROM . 69 

1 77 

78 

HIRT.... 
UEST ... 

TION . . 218 

2 34 

99 

VENT. 

EVEN. . 168 

2 23 

95 

ASHT 

TEEN . 163 

2 21 

94 

OURT.... 

TWEN 152 

2 18 

93 

UGHT.... 

ERIO . 66 

1 82 

80 

STAT. 

CENT. 

STOP ... 154 

2 19 

93 


FOUR .. 144 

2 16 

92 

REQU. 

THIR .. 104 

2 02 

87 


LLAR. .. 71 

1 85 

8i 

ENTY_ 

UNDR ._ 59 

1 77 

78 

IRTY ... 


F 

1 LioCD 

Lnt 

(2D 

87 

1 94 

84 

68 

1 83 

80 

68 

1 83 

80 

62 

1 79 

78 

153 

2 18 

93 

140 

2 15 

92 

111 

2 05 

88 

102 

2 01 

87 

97 

1 99 

86 

87 

1 94 

84 

70 

1 85 

81 

64 

1 81 

79 

56 

1 75 

77 

66 

1 75 

77 

54 

1 73 

76 

52 

1 72 

76 

1 

98 

1 99 

86 

161 

2 21 

94 

69 

1 77 

78 
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Table 12 — Average length of words and messages 


Number of 
letters m 
word 

X 

Number of 
times a;-letter 
word appears 

Number of 
letters 

i 

378 

378 

2 

973 

1,946 

3 

1,307 

3,921 

4 

1,635 

6,540 

5 

1,410 

7,050 

6 

1,143 

6,858 

7 

1,009 

7,063 

8 

717 

5,736 

9 

476 

4,284 

10 

274 

2,740 

11 

161 

1,771 

12 

86 

1,032 

13 

23 

299 

14 

23 

322 

15 

4 

60 


9,619 

50,000 


(1) Average length of words.........___.5 2 letters 

(2) Average length of messages........217 letters 

(3) Modal (most frequent) length-----105-114 letters 


(4) It is extremely unusual to find five consecutive letters without at least one vowel 
(6) The average number of letters between vowels is two 
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Tacld 13 — Checkerboard individual frequencies 1 

[Bfisod on a count of D 000 digraphs] 


Pi Cl 


A 

B 

C 

D 

E 

244 

226 

375 

394 

197 

F 

6 

H 

I J 

K 

125 

98 

193 

271 

95 

L 

M 

N 

0 

P 

229 

199 

188 

350 

251 

Q 

R 

S 

T 

U 

148 

162 

258 

427 

295 

V 

ff 

X 

Y 

Z 

42 

12 

34 

91 

97 

212 

317 

358 

308 

249 

A 

B 

C 

D 

E 

120 

108 

216 

256 

85 

F 

G 

H 

I J 

K 

216 

140 

152 

435 

269 

L 

M 

N 

0 

P 

206 

121 

306 

364 

284 

Q 

R 

S 

T 

U 

38 

29 

21 

147 

43 

V 

W 

X 

Y 

Z 


C, P, 


1 The numbers m the Ci C 2 squares represent the frequency of the individual components of the cipher digraph 
used to replace a given Pi P 2 digraph in accordance with a digraphic checkei board system where Pi and P» are the 
plam-text squares 
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Table 14 — Relative logarithmic values of frequencies of English digraphs 

{Based on a count of 5,000 digraphs. To obtain logarithm to baso 10 (Log 10) divide by 100] 


Second Letter 



A 

B 

c 

D 

E 

F 

G 

H 

I 

J 

K 

L 

M 

N 

0 

P 

Q 

R 

s 

T 

u 

V 

W 

X 

Y 

z 

A 

48 

i 

115 

143 

B 

|Q 

78 

B 

123 



161 

116 

181 


B 

* 

164 

161 

167 

111 

86 

48 

sk 


sk 

B 

60 

>k 

A 

* 

126 

* 

>k 

* 


i 

* 

78 

B 

* 

B 

* 

sk 



B 

B 

sk 

>k 

A 

85 

sk 

C 

1 

* 

48 

EE 

151 

12 

* 

116 

85 

* 

m 




161 

* 

A 



116 


sk 


sk 

i 

sk 

D 

161 

60 

m 


162 

| 

| 

B 

143 


* 

48 


B 


70 

B 


111 

118 

B 

48 


sk 

E 

sk 

E 

164 


151 

178 

162 

126 

| 

85 

143 

i 


146 

116 

B 


B 


194 

173 

167 

48 

130 

85 

85 

60 


F 

70 


B 


H 

104 

B 

* 

159 

B 

* 

30 

m 

* 


B 

* 

95 

48 


48 

>k 

B 

sk 


sk 

G 

85 

* 



115 

B 


a 

70 

00 

sk 



48 

78 

B 

* 

B 

48 


B 

sk 

E 

sk 

sk 

sk 

H 


00 

48 

B 

m 

B 

sk 

sk 

152 

* 

sk 


M 

48 

B 

B 


123 

El 

145 

B 

sk 

B 

sk 


sk 

I 


30 

135 

78 

111 

B 

128 

* 

* 

sk 

m 

136 

95 

188 

161 

85 

5k 

143 

154 

143 

5k 

B 

sk 

118 

sk 

m 

J 

B 

* 

* 

A 

EB 

* 

* 

* 

* 

* 

* 

* 

*k 

* 

30 

* 

A 

5k 

5k 

5k 

30 

5k 

A 

sk 

sk 

sk 

K 

n 

* 

B 

* 

78 

* 

* 

V 

B 

sk 

* 


* 

B 

* 

* 

sk 

sk 

B 

5k 

5k 

sk 

A 

sk 

sk 

sk 

L 

146 

48 

48 

95 

167 

48 

b 

B 


* 

* 

143 

B 

B 

111 

48 

5k 

B 

78 

90 

B 

B 

B 

sk 

B 

sk 

M 

166 

78 

48 

b 

141 

B 

sk 

B 

95 

* 

* 

* 

111 

* 

B 

B 

A 


B 



5k 

sk 

sk 

B 

sk 

N 

141 

30 

128 

172 

176 

95 

43 

60 

148 

w 

I 

70 


90 

126 

48 



138 

191 

85 

48 

48 

sk 

B 

sk 

0 . 

85 

60 

Jgj 

B 

48 

B 

B 

48 

70 


I 

128 

gg 

189 

78 

B 

sk 

181 

115 

128 

157 

85 

90 

lift 

■■ 

B 

sk 

P 1 

115 

00 

B 

B 

136 


* 

48 

78 

* 

* 

111 

B 

B 

123 


A 

126 

78 

B 

48 

B 

Ejl 

sk 

B 

sk 

Q 

* 

* 

* 

»k 

* 

* 

* 

*k 

* 

* 

* 

* 

B 

sk 

* 

5k 

sk 


sk 

sk 

118 

sk 

sk 

>k 

* 

sk 

R 

159 

m 

95 

123 

199 

78 

85 

48 

148 

B 

00 


95 

85 

145 

m 

»k 


149 

162 

B 

m 

60 

sk 

95 

*k 

S 

138 

48 

BS 

70 

169 

108 

B 

142 

153 

* 

00 

B 

48 

B 

118 

B 

5k 

B 

128 

B 

B 

B 

B 

sk 

B 

sk 

T 

145 

48 

78 

78 

185 

85 


189 

165 

* 

* 

B 

78 

85 


30 

Eft 

123 

128 

128 


5k 

166 

sk 

161 

E0 

U 

70 

48 

48 

48 

m 


B 

* 

B 

* 

5k 

78 

70 

132 


30 

»k 

149 

B 

108 

5k 

B 

*k 

sk 

sk 


V 

78 

* 

* 

* 

176 

* 

* 

* 

m 

* 

* 

* 

sk 

sk 

B 

* 

5k 

5k 

sk 

B 

5k 

sk 

sk 

sk 

sk 

sk 

w 

m 

* 

* 

* 

134 

* 

* 

B 

111 

* 

sk 

B 

sk 

B 

128 

5k 

5k 

B 

B 

sk 

A 

A 

sk 

sk 

B 

sk 

X 

E 

* 

|Q 

B 

B 

B 

* 

B 

m 

* 

* 

* 

* 

B 


m 

5k 

B 

B 

85 

5k 

sk 

sk 

sk 

sk 

sk 

Y 


m 

60 


95 

B 

B 

B 

48 

* 

* 

B 

B 

78 

B 

48 

* 

B 


118 


sk 

B 

sk 

sk 

sk 

Z 

00 

* 

* 

* 

|Q 


>1 

5fS 

00 


sk 

* 

* 

* 



5k 

5< 

5k 

5k 

* 


* 

* 

* 

sk 


*In computations, assign a value of-100 as the log for these digraphs These combinations do not usually 
occur in 6,000 digraphs Do not assign “0” to these combinations as that is the logarithmic value for a fre¬ 
quency of one, and these combinations have a frequency of less than one 
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Table 15 —Relative logarithmic values ( Log „ 222) of frequencies of English digraphs * 


BCD 


[Baaed on n count of 6,000 digraphs) 

Second Lexter 

GHIJKLMNOPQRSTUVWXYZ 



♦See pages 11-12 for details 
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SPECIAL-PURPOSE DATA 


Table l6-A .—Frequency distribution of digraphs, based on 64,3^5 
letters of decrypted U. S, Government messages m which Z was used as 
a word-sepai ator and X was used for Loth Xp and Zp. 

X** L+r 

ABCDEFGHIJKLMNOPQRSTUVWXY 


Z 



In the text which gave rise to this and the following two tables, the 
frequently-used punctuation signs "coma" and "period" were eobreviated 
as CMA and PD, respectively, and the procedure term "repeat" was abbre¬ 
viated as RTTj thus, the digraphs CI1, PD, PT, and RP, which usually do 
not occur frequently (see Table 6-A), are of relatively high frequency 
here. 
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Table Ii3-B ■.--Frequency clLstrlbution of digraphs, baicd on the 
text used for Table l6-A, from which the Z wore] -separator has been 
omt'oted (total* 53*866 letters)* 

!♦* L+v- 



A 

B 

C 

D 

E 

F 

G 

II 

I 

j 

K 

L 

M 

N 

0 

P 

Q 

R 

s 

T 

U 

V 

w 

X 

X 

z 

A 

n 

ns 

(40 

(64 

40 

136 

III 

2C 

m 

14 

S2 

217 

166 

434 

S7 

147 

3 

C$7 

614 

31S 

45* 

58 

4o 

23 

67 


B 

« 

»4 

4 

2 

(43 

5 


i 

43 

3Z 


114 

4 

IS 

41 

z 


42 

11 

13 

IS 

2 

3 

*5 

4o 


C 

133 

1 

31 

20 

243 

32 

14 

ISt 

114 


47 

47 

343 

11 

416 

? 

Z 

78 

74 

180 

41 

1 

4 

4 

27 


D 

443 

C6 

101 

7+ 

*507 

76 

24 

13 

173 

7 

S’ 

23 

31 

22 

(SI 

47 

14 

142 

US 

(S3 

54 

+0 

ss 

z 

Is 


E 

114 

10 

371 

471 

610 

173 

43 

37 

324 

21 

|Z 

201 

no 

8ss 

121 

21S 

43 

431 

476 

367 

S3 

IIS 

87 

(3* 

34 


F 

CO 

n 

41 

IS 

104 

m 

1 

z 

380 

3 

1 

34 

is 

10 

sai 

80 

1 

148 

56 

67 

1-4 

3 

1 

3 

7 


G 

(02 

34 

2-0 

SI 

244 

(4 

32 

242 

37 

S 

z 

11 

(0 

41 

4S 

38 

4 

11 

53 

38 

31 

2 

3 

*7 

1 


H 

170 

7 

34 

18 

ns 

S4 

13 

31 

210 

14* 

II 

8 

13 

34 

134 

It 

23 

134 

64 

3IS 

18 

3 

It 

s 

3 


I 

Qj 

H 

E9 

□ 

03 

H 

330 

8 

S’ 

i 



yj 

70S 

20 1 

Ef!!l 

1 

B3 

0 

21? 

1 

170 

3 

0 

B 


J 

33 

a 

7 

Z 

31 

Q 


1 

24 

20 

B 

□ 

B 

5 


B 

B 

4 

2 

11 

1 

2 

z 


B 


K 

a 


7 

□ 

109 

B 

2 


5*4 

3 


B 

B 

13 

B 

D 

1 

6 

11 

S' 

2. 

1 

4 

0 

□ 


L 

117 


38 

61 

m 

47 

2 

13 

2o7 

7 

D 


H 

67 

03 

14 


11 

B 

So 

ffl 

8 

14 

1 



M 

54S 

03 

66 

IS 

2jOL 

22 

04* 

4 

46 

4 

i 

6 

n 

B 

Q 

U3 

3 

24 

B 

4o 

22 

1 

(0 

is 

n 


1 

113 

27 

nt 

336 

748 

134 

1S4 

12 

U3 

4 

ii 

□ 

47 

m 

0 

11 

It 

66 

20 

3S2 

75 

23 

18 

18 


B 

0 

« 

B 

1<U 

ISS 

43 

416 

12 

47 

37 

13 

17 

171 

2S1 

410 

14 

111 

z 

473 

20t 

114 

411 

SI 

67 

170 

17 

i 

P 

3 IK 

7 

14 


170 

5 

3 

40 

14 

4 


76 

46 

II 

ISO 

HI 

3 

174 

B 

36S 

14 

z 

1 

1 

5 


Q 

14 

4 

3 


7 

2 


4 

5 


1 

z 

S 

II 

s 

1 

i 

to 

to 

1 

117 

B 

3 

1 



R 

33 

u 

131 

•46 

loll 

84 

tc 

(4 

107 

17 

40 

64 

141 

54 

631 

341 

8 

103 

166 

163 

67 

□ 

14 

3o 

7+ 


S 

237 

37 

143 

11 

316 

(44 

ss 

4S3 

361 

S 

11 

25* 

4& 

36 

173 

I2<1 

14 

41 

178 

a8S 

»44 

14 

34 

z 

43 


T 

177 

30 

l« 

7o 

436 

47 

21 

S42 

307 

(4_ 

16 

43 

67 

loo 

463 

4S 

B 

B 

0 

172 


12 

33g 

3o 

57 


U 

Q] 

48 

33 

4| 

88 

7 

Cl 

z 

1 .... 

36 

4 

4r 

S6 

61 

47 

iS 

46 


4S4 

50 

0 


s 

6 

6 

1 


V 



1 IS 

S 

414 

1 

1 


^Z 


z 

4 

3 

8 

2-1 

4 

B 

B 

B 

B 

B 

1 

1 

1 

7 


1 

B 

D 

B 

4 

31 

2 

12 

3S 

107 

3 

1 

10 

1 

14 

367 

to 

z 

3 

H 

6 

1 


13 

13 

4 


X 1 

1? 

B 

B 

□ 

361 

Zo 


4 

12. 


Id 

Z 

4 

II 

24 

41 

3 

24 

14 

47 

4 

ft 


S4 

3 


a 

□ 

B 

B 

ED 

•4 

33 

IS 

s 

zZ 

1 

i 

7 

21 

IS 

74 

11 

1 

31 

34 

38 

10 

1 

18 


la 

1 

z 

_ 
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Tabic l6-C .—The 53 digraphs from Table 6-A which comprise 50$ of 
the total, arranged according to frequencies reduced to a base of 5,000 
digraphs, shown with the corresponding frequencies of the same digraphs 
from Table l6-B (also reduced bo a base of 5,000) • 1 


Dig. 

6 -A 

16-B 

Dig, 

6 —A 

16 -B 

EN 

111 

79 

FO 

40 

54 

BE 

98 

94 

FI 

39 

35 

ER 

87 

86 

RA 

39 

28 

m 

82 

33 

ET 

37 

34 

TH 

78 

55 

LE 

37 

31 

ON 

77 

84 

OU 

37 

39 

IN 

75 

65 

MA 

36 

55 

TE 

71 

36 

TO 

36 

31 

AN 

6 h 

4l 

EA 

35 

28 

OR 

64 

44 

IS 

35 

28 

ST 

63 

36 

SI 

34 

34 

ED 

60 

45 

DE 

33 

29 , 

BE 

57 

69 

HI 

33 

20 1 

VE 

57 

46 

AL 

32 

21 1 

ES 

54 

44 

CE 

32 

24 

ND 

52 

31 

DA 

32 

41 

TO 

50 

43 

EC 

32 

36 

SE 

49 

37 

RS 

31 

25 

AT 

47 

37 

TJR 

31 

42 

TI 

45 

29 

NI 

30 

24 

AR 

44 

61 

HI 

30 

19 J- 

EE 

42 

64 

EL 

29 

19 1 

HE 

42 

24 

HE 

28 

29 , 

AS 

4i 

57 

LA 

28 

18 1 

CO 

4i 

39, 

RO 

28 

59 

10 

4l 

19 I 

TA 

28 

26 

TY 

4l 

5 1 





With the exception of AL, EL, HI, 10, LA., HI, TY, the digraphs of 
this table are all from among the 65 digraphs from Table 16 -B which 
comprise 50 $ of the total. 
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APPENDIX 3 

WORD AND PATTERN LISTS - ENGLISH | 

Section Pages 

A. List of words used In military text arranged alphabeti¬ 

cally according to word length... 2-10 

B. List of words used in military text arranged in rhyming 

order according to word length....... 11-19 

C. List of words used in military text arranged alphabeti¬ 

cally according to word pattern...20-37 

D* Digraphic idiomorphs: general. 38-39 

E. Digraphic idiomorphs: Playfair.....40-42 

F. Digraphic idiomorphs: four-square.. 43-45 
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A. LIST OF WORDS USED IN MILITARY TEXT ARRANGED ALPHABETICALLY 

ACCORDING TO WORD LENGTH 


TWO LETTER WORDS 


AM 

BY 

EM 

IN 

MM 

OK 

TO 

AN 

CO 

GO 

IS 

MP 

ON 

US 

AS 

CP 

HE 

IT 

MY 

OR 

WD 

AT 

CQ 

HQ 

MC 

NO 

QM 

WE 

BE 

BN 

DO 

IF 

ME 

OF 

SO 

WO 


THREE LETTER WORDS 


ACT 

BID 

DUN 

HAS 

MIX 

PVT 

TEN 

ADD 

BIG 

EAT 

HER 

NAN 

QMC 

THE 

ADJ 

BOX 

END 

HIM 

NET 

RED 

TIN 

AGE 

BUT 

EYE 

HIS 

NEW 

RID 

TON 

AGO 

BUY 

FAR 

HOW 

NOT 

ROB 

TOO 

AID 

CAM 

FEW 

ILL 

NOW 

RUN 

TOP 

AIM 

CAN 

FIT 

ITS 

OFF 

SAW 

TRY 

AIR 

CAR 

FIX 

JIG 

OLD 

SAY 

TUB 

ALL 

CAV 

FOR 

JOB 

ONE 

SEA 

TWO 

AND 

COL 

FOX 

KEG 

OUR 

SEE 

USE 

ANY 

CPL 

GAL 

LAW 

OUT 

SET 

VAT 

APT 

CUT 

GAS 

LAY 

OWE 

SGT 

WAR 

ARC 

CWT 

GEN 

LET 

OWN 

SHE 

WAS 

ARE 

DAY 

GET 

LOT 

PAR 

SIX 

WAY 

ARM 

DID 

GHQ 

LOW 

PAY 

SPY 

WET 

ASK 

DIE 

GOT 

MAJ 

PEN 

SUM 

WGT 

BAD 

DOG 

GUN 

MAN 

PER 

SUN 

WON 

BAG 

DRY 

HAP 

MAT 

PIN 

TAN 

YET 

BAR 

DUE 

HAM 

~ MEN 

PUT 

TAX 

YOU 


FOUR LETTER WORDS 


ABLE 

BOTH 

EACH 

FLEE 

HIGH 

LATE 

MAIN 

AIDE 

BULB 

EAST 

FORM 

HILL 

LEAD 

MANY 

ALLY 

BULK 

EASY 

FOUR 

HITS 

LEAK 

MASK 

ALSO 

CALL 

EDGE 

FROM 

HOLD 

LEFT 

MASS 

AREA 

CELL 

EYES 

FULL 

HOOK 

LESS 

MEAT 

ARMY 

CITY 

FALL 

FUSE 

INTO 

LIEU 

MEET 

ASIA 

CODE 

FARM 

FUZE 

ITEM 

LINE 

MESS 

AWAY 

COOK 

FAST 

GUNS 

JOIN 

LIST 

MIKE 

AXIS 

DARK 

FEEL 

HALF 

JULY 

LOAD 

MILE 

BACK 

DASH 

FEET 

HALT 

JUNE 

LONG 

MINE 

BASE 

DATE 

FELL 

HAND 

JUST 

LOOK 

MORE 

BEEN 

DAYS 

FILE 

HARD 

KEEP 

LOSS 

MOVE 

BLUE 

DIRT 

FIRE 

HAVE 

KIND 

LOST 

MTCL 

BODY 

DOWN 

FIRM 

HEAD 

KING 

LOVE 

MULE 

BOMB 

DRAW 

FIVE 

HERD 

LAND 

MADE 

NAVY 

BOOK 

DUMP 

FLAG 

HERE 

LAST 

MAIM 

NEAR 
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FOUR LETTER WORDS—Continued 


NEXT 

PARK 

REAR 

SHOT 

TEAM 

TOOK 


WEST 

NINE 

PASS 

RIOT 

SIDE 

TENT 

TOOL 


WHAT 

NOON 

PIPE ’ 

ROAD 

SOME 

TEXT 

TOWN 


WHEN 

NOTE 

PLAN 

ROUT 

SOON 

THAN “ “ 

type 


WILL 

OBOE 

POST 

RULE 

STOP 

THAT 

UNIT 


WIRE 

OMIT 

PUMP 

RUSH 

SUNK 

THEM 

VARY 


WITH 

ONCE 

Bush 

SAID 

TAKE 

THEN 

•VERY 


XRAY 

ONLY 

RAID 

SAME 

TALK 

THEY 

WEAK 


YOKE 

OPEN 

RAIL 

SANK 

TANK 

THIS 

WEEK 

- 

ZERO 

ORAL 

RAIN 

SEEN 

TARE 

TIME 

WELL 


ZONE 

OVER 

RANK 

SHIP 

TASK 

TONS 

WERE 





ABOUT 

BOATS 

DECKS 

FIVE LETTER WORDS 

FIGHT LATER 

PRIOR 

SHIPS 

TITLE 

AFTER 

BOMBS 

DEFER 

FIRES 

LEAST 

PROOF 

SHORE 

TODAY 

AGAIN 

BOOTH 

DELAY 

FIRST 

LEAVE 

PROVE 

SIEGE 

TOTAL 

AGENT 

BREAK 

DEPOT 

FLANK 

LEVEL 

QUEEN 

SIGHT 

TRACT 

ALARM 

BRIBE 

DEPTH 

FLARE 

LIGHT 

QUICK 

SIXTH 

TRAIN 

ALERT 

BROKE 

DOCKS 

FLATS 

LIMIT 

QUIET 

SIXTY 

TROOP 

ALIGN 

BURST 

DRAWN 

FLEET 

LOCAL 

RADIO 

SLOPE 

TRUCE 

ALINE 

CANAL 

DRESS 

FOGGY 

MAJOR 

RAFTS 

SMALL 

TRUCK 

ALLOW 

CASES 

DRILL 

FORCE 

MARCH 

RAIDS 

SMOKE 

UNDER 

ALONG 

CAUSE 

DRIVE 

FORTY 

METER 

RALLY 

SOUTH 

UNION 

AMONG 

CEASE 

EAGER 

Fresh 

MILES 

RANGE 

SPEED 

UNITS 

ANNEX 

CHECK 

EARLY 

FRONT 

MOTOR 

RAPID 

SPELL 

USUAL 

APPLY 

CHIEF 

EIGHT 

GATES 

NAVAL 

REACH 

SPLIT 

VALOR 

APRIL 

CLEAR 

ENEMY 

GAUGE 

NIGHT 

READY 

SQUAD 

VISIT 

AREAS 

CLERK 

ENTER 

GIVEN 

NINTH 

REFER 

STAFF 

VITAL 

ARMOR 

CLOSE 

EQUAL 

GOING 

NORTH 

REPEL 

STAKE 

VOCAL 

ASSET 

COAST 

EQUIP 

GROUP 

ORDER 

RIDGE 

START 

VOICE 

AWAIT 

COLON 

ERASE 

GUARD 

OTHER 

RIGHT 

X STEEL 

WAGON 

AWARD 

COMMA 

ERROR 

GUEST 

PACKS 

RIGID 

SUGAR 

WEIGH 

BAKER 

CORPS 

ETHER 

HEAVY 

PAIRS 

RIVER 

TAKEN 

WHEEL 

BANKS 

COUNT 

EVERY 

HONOR 

PARTY 

ROGER 

TANKS 

WHERE 

BARGE 

COVER 

FATAL 

Horse 

PETER 

ROUTE 

TENTH 

WHICH 

BEACH 

CREEK 

FEARS 

HOURS 

PLACE 

SCALE 

THEIR 

WIDTH 

BEGIN 

CREST 

FERRY 

HOUSE 

PLAIN 

SEIZE 

THERE 

WIPED 

BEING 

CROSS 

FIELD 

ISSUE 

PLANS 

SEVEN 

THESE 

WOODS 

BLACK 

CURVE 

FIFTH 

JAPAN 

POINT 

SHELL 

THIRD 

YARDS 

BLIND 

DAILY 

FIFTY 

LARGE 

PRESS 

SHIFT 

THREE 

ZEBRA 
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SIX LETTER WORDS 


ACCEPT 

BOMBED 

DEGREE 

FIERCE 

LESSON 

OTHERS 

RESUME 

SUFFER 

ACCESS 

BOMBER 

DEPART 

FILING 

LETTER 

OUTPUT 

RETIRE 

SUMMER 

ACROSS 

BOTTOM 

DEPEND 

FINISH 

LINING 

PANAMA 

RETURN 

SUMMIT 

ACTION 

BRANCH 

DEPLOY 

FIRING 

LIQUID 

PARADE 

REVIEW 

SUMMON 

ACTIVE 

BREACH 

DESERT 

FLIGHT 

LITTER 

PARLEY 

RIDING 

SUNDAY 

ADJUST 

BREEZE 

DETACH 

FLYING 

LITTLE 

PASSED 

ROCKET 

SUNKEN 

ADVICE 

BRIDGE 

DETAIL 

FOLLOW 

LOCATE 

PASSES 

ROUTED 

SUNSET 

ADVISE 

BROKEN 

DEVICE 

FORCES 

LOSSES 

PATROL 

ROUTES 

SUPPLY 

AFFAIR 

BUREAU 

DEVISE 

FORMAL 

MANAGE 

PERIOD 

RUBBER 

SURVEY 

ALASKA 

CANADA 

DIRECT 

FORMrD 

MANNER 

PICKET 

RUNNER 

SWITCH 

ALLEGE 

CANCEL 

DIVERT 

FOUGHT 

MANUAL 

PINCER 

SALARY 

SYSTEM 

ALLIED 

CANNOT 

DIVIDE 

FOURTH 

MEAGER 

PISTOL 

SCHEME 

TABLES 

ALLIES 

CANVAS 

DOCTOR 

FRIDAY 

MEDIUM 

PLACES 

SCHOOL 

TANKER 

ALWAYS 

CASUAL 

DOLLAR 

FUTURE 

MEMBER 

PLANES 

SCORED 

TARGET 

ANIMAL 

CAUSED 

DOWNED 

GARAGE 

METHOD 

POINTS 

SCREEN 

TATTOO 

ANNUAL 

CENTER 

DRYRUN 

GEORGE 

METRIC 

POISON 

SEAMAN 

TERROR 

ANYWAY 

CHANGE 

DUGOUT 

GREASE 

MINING 

POLICE 

SEAMEN 

THIRTY 

APPEAR 

CHARGE 

DURING 

GROUND 

MINUTE 

PONTON 

SEARCH 

JifOUGH 

ARABIA 

CHEESE 

EFFECT 

GUNNER 

MIRROR 

POSTAL 

SECOND 

’”1 'REAT 

ARMIES 

CHURCH 

EFFORT 

HALTED 

MOBILE 

PREFER 

SECTOR 

'I RAINS 

ARMORY 

CIPHER 

EIGHTH 

HAMMER 

MONDAY 

PROMPT 

SECURE 

TRENCH 

ARREST 

CIRCLE 

EIGHTY 

HAPPEN 

MORALE 

PPOPER 

SELECT 

TROOPS 

ARRIVE 

COFFEE 

EITHER 

HARBOR 

MORTAR 

PURSUE 

SERIAL 

TURRET 

ASSETS 

COLORS 

ELEVEN 

HELPER 

MOVING 

RADIAL 

SETTLE 

TWELVE 

ASSIST 

COLUMN 

EMBARK 

HIGHER 

MURDER 

RAIDED 

SEVERE 

TWENTY 

ASSURE 

COMBAT 

EMPLOY 

HOURLY 

MUZZLE 

RATION 

SHELLS 

UNABLE 

ATTACH 

COMMIT 

ENCODE 

INDEED 

NAUGHT 

RAVINE 

SIGCOM 

UNITED 

ATTACK 

COMMON 

ENGAGE 

INFORM 

NEARER 

RECORD 

SIGNAL 

UNLESS 

ATTAIN 

CONVEY 

ENGINE 

INLAND 

NINETY 

REDUCE 

SINGLE 

VALLEY 

AUGUST 

CONVOY 

ENROLL 

INTEND 

NORMAL 

REFILL 

SLIGHT 

VERBAL 

BANNER 

COURSE 

ENTIRE 

INTENT 

NOTING 

REFUGE 

SPHERE 

VERIFY 

BARBED 

CREDIT 

ERASER 

INVENT 

NOUGHT 

REFUSE 

SPOOLS 

VESSEL 

BARGES 

CRISIS 

ESCORT 

ISLAND 

NOVICE 

REJECT 

SPOONS 

VICTIM 

BATTEN 

CRITIC 

EUROPE 

ISSUES 

NOZZLE 

RELIEF 

STATES 

VICTOR 

BATTLE 

DAMAGE 

EXCEPT 

KEEPER 

NUMBER 

REMAIN 

STATUS 

VISITS 

BEETLE 

DEBARK 

EXCESS 

KILLED 

OCCUPY 

REMEDY 

STRAFE 

VISUAL 

BEFORE 

DECIDE 

EXCITE 

LADDER 

OFFEND 

REPAIR 

STREET 

WEIGHT 

BETTER 

DECODE 

EXPECT 

LANDED 

OFFICE 

REPORT 

STRESS 

WIRING 

BEYOND 

DECREE 

EXPELS 

LAUNCH 

OPPOSE 

RESCUE 

STRIPS 

WITHIN 

BILLET 

DEFEAT 

EXPEND 

LEADER 

ORDERS 

RESIST 

SUBMIT 

WOODED 

BITTER 

DEFECT 

EXTEND 

LEAGUE 

ORIENT 

RESULT 

SUDDEN 

ZIGZAG 

BODIES 

DEFEND 

EXTENT 








SFVEN LETTER WORDS 




ABANDON 

ALMANAC 

APPOINT 

ASIATIC AVIATOR 

BATTERY 

BETWEEN 

ABSENCE 

AMMETER 

APPROVE 

ASSAULT AWKWARD 

BATTLES 

BICYCLE 

ADDRESS 

ANALYZE 

ARMORED 

ATTACKS BAGGAGE 

BEARING 

BINDING 

ADVANCE 

ANOTHER 

ARRANGE 

ATTEMPT BALLOON 

BECAUSE 

BIVOUAC 

AGAINST 

ANTENNA 

ARRIVAL 

AVERAGE BARRAGE 

BEDDING 

BOMBARD 
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SEVEN LETTER WORDS—Continued 


BOMBERS 

DEBOUCH 

FITTING 

LANDING 

PACKAGE 

REQUEST 

SUPPOSE 

BOMBING 

DECIDED 

FOGHORN 

LEADING 

PASSAGE 

REQUIRE 

SURPLUS 

BOYCOTT 

DECLARE 

FORCING 

LECTURE 

PASSIVE 

RESERVE 

SUSPEND 

BRIBERY 

DECODED 

FORGING 

LIAISON 

PATROLS 

RESPECT 

TACTICS 

BRIGADE 

DEFENSE 

FORWARD 

IIBRARY 

PAYROLL 

RESPOND 

TALKING 

CALIBER 

DELAYED 

FOXHOLE 

LICENSE 

PLACING 

RFTIRED 

TARGETS 

CALIBRE 

DELIVER 

FUELOIL 

LIFTING 

PLATdbN 

RETREAT 

TERRAIN 

CAPTAIN 

DERRICK 

FURNISH 

LbADlNG 

POUNDER 

REVENUE 

THATTHE 

CAPTIVE 

DESTROY 

FURTHER 

LOGICAL 

PRAIRIE 

REVERSE 

THROUGH 

CARRIER 

DETRAIN 

GASSING 

LOOKOUT 

PRECEDE 

REVOLVE 

TOBACCO 

CAVALRY 

DETRUCK 

GENERAL 

MACHINE 

PREPARE 

ROUTINE 

TONIGHT 

CENTRAL 

DEVELOP 

GETTING 

MANDATE 

PRESENT 

RUNNING 

TONNAGE 

CHANGES 

DIAGRAM 

GLASSES 

MANNING 

PRESSED 

SAILORS 

TORPEDO 

CHANNEL 

DISCUSS 

GRADUAL 

MAPPING 

PRIMARY 

SATISFY 

TRACTOR 

CHARLIE 

DISEASE 

GRENADE 

MARCHED 

PROCEED 

SECRECY 

TRAFFIC 

CHASSIS 

DISMISS 

GUARDED 

MARSHAL 

PROGRAM 

SECTION 

TRAWLER 

CIRCUIT 

DISTILL 

HALTING 

MARTIAL 

PROMOTE 

SECTORS 

TRIGGER 

CLIPPER 

DROPPED 

HASBEEN 

MAXIMUM 

PROPOSE 

SERVICE 

TUESDAY 

COASTAL 

EASTERN 

HEADING 

MEDICAL 

PROTECT 

SESSION 

TWELFTH 

COLLECT 

ECHELON 

HEAVIER 

MESSAGE 

PROTEST 

SETBACK 

UNKNOWN 

COLLEGE 

ELEMENT 

HIGHEST 

MESSING 

PROVOST 

SEVENTH 

UNUSUAL 

COLONEL 

ELEVATE 

HOLDING 

MILITIA 

PURPOSE 

SEVENTY 

USELESS 

COMMAND 

EMBASSY 

HORIZON 

MINIMUM 

PURSUIT 

SEVERAL 

UTILITY 

COMMEND 

ENCODED 

HOSTILE 

MISFIRE 

PUSHING 

SHELLED 

VACANCY 

COMMENT 

ENEMIES 

HUNDRED 

MISSING 

QUARTER 

SHORTLY 

VARYING 

COMMUTE 

ENFORCE 

ICEBERG 

MISSION 

QUICKLY 

SIGNIFY 

VESSELS 

COMPANY 

ENGAGED 

ILLEGAL 

MORNING 

RADIATE 

SIMILAR 

VICTORY 

COMPASS 

ENTENTE 

ILLNESS 

NATURAL 

RAIDING 

SIMPLEX 

VILLAGE 

CONCEAL 

ENTRAIN 

INCLUDE 

NEAREST 

RAILWAY 

SINKING 

VISIBLE 

CONDEMN 

ENTRUCK 

INFLICT 

NIGHTLY 

RAINING 

SIXTEEN 

VISITOR 

CONDUCT 

ENVELOP 

INITIAL 

NOTHING 

RAPIDLY 

SLOPING 

WARFARE 

CONFINE 

EXCLUDE 

INQUIRE 

NUMBERS 

REACHED 

SMOKING 

WARSHIP 

CONTACT 

EXPLAIN 

INQUIRY 

OBSERVE 

RECEIPT 

SOLDIER 

WEATHER 

CONTAIN 

EXPRESS 

INSPIRE 

OCTOBER 

RECEIVE 

STARTER 

WESTERN 

CONTROL 

EXTRACT 

INSTALL 

OFFENSE 

RECOVER 

STATION 

WHETHER 

CORRECT 

EXTREME 

INSTANT 

OFFICER 

RECRUIT 

STEAMER 

WILLIAM 

COUNCIL 

FALLING 

INVADED 

OMITTED 

REDUCED 

STOPPED 

WINDAGE 

COURIER 

FARTHER 

ISLANDS 

OPERATE 

REFUGEE 

STORAGE 

WITHOUT 

COVERED 

FEDERAL 

ISSUING 

OPINION 

REGULAR 

SUCCESS 

WITHTHE 

CROSSED 

FIFTEEN 

JANUARY 

ORDERED 

RELEASE 

SUGGEST 

WITNESS 

CRUISER 

FIGHTER 

JUMPOFF 

OUTPOST 

RELIEVE 

SUMMARY 

WOUNDED 

CURRENT 

FILLING 

KITCHEN 

OUTSIDE 

JREPAIRS 

SUNRISE 

WRECKED 

CYCLONE 

FINDING 

KILLING 

PACIFIC 

REPLACE 

SUPPORT 

WRITTEN 

DAMAGED 

FISHING 








EIGHT 

LETTER WORDS 



ACTIVITY 

ADVANCED 

AIRBORNE 

AIRPLANE 

ANNOUNCE 

APPROACH 

ASSEMBLE 

ACTUALLY 

ADVANCES 

AIRCRAFT 

ALTITUDE 

ANTITANK 

APPROVAL 

ASSEMBLY 

Adjacent 

ADVISING 

AIRDROME 

AMERICAN 

APPARENT 

ARMAMENT 

ASSIGNED 

ADJUTANT 

ADVISORY 

AIRFIELD 

ANALYSIS 

APPEARED 

ARRESTED 

ASSOONAS 


RESTRICTED 
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EIGHT LETTER WORDS—Continued 


ATLANTIC 

ATTACKED 

ATTEMPTS 

AVIATION 

BARRACKS 

BARRAGES 

BATTERED 

BATTLING 

BESEIGED 

BILLETED 

BOUNDARY 

BREAKING 

BUILDING 

BULLETIN 

BUSINESS 

CALAMITY 

CAMPAIGN 

CANISTER 

CAPACITY 

CAPTURED 

CARELESS 

CARRIAGE 

CARRIERS 

CARRYING 

CASUALTY 

CAUSEWAY 

CEMETERY 

CENTERED 

CHAPLAIN 

CHEMICAL 

CIRCULAR 

CITATION 

CIVILIAN 

CLERICAL 

CODEBOOK 

COMMANDS 

COMMENCE 

COMMERCE 

COMPLETE 

COMPOSED 

CONCLUDE 

CONCRETE 

CONFLICT 

CONGRESS 

CONTINUE 

CONTRACT 

CORPORAL 

CORRIDOR 

COVERING 

CRITICAL 


CRITIQUE 

CROSSING 

CRUISERS 

DAMAGING 

DARKNESS 

DAYLIGHT 

DECEMBER 

DECIPHER 

DECISION 

DECISIVE 

DECLARED 

DECREASE 

DEDICATE 

DEFEATED 

DEFENDED 

DEFENDER 

DEFENSES 

DEFERRED 

DEFINITE 

DELAYING 

DEMANDED 

DEPARTED 

DEPLOYED 

DEPORTED 

DESCRIBE 

DESERTED 

DESERTER 

DESPATCH 

DETACHED 

DETECTOR 

DETONATE 

DEVELOPE 

DICTATED 

DICTATOR 

DIMINISH 

DIRECTOR 

DISARMED 

DISASTER 

DISLODGE 

DISPATCH 

DISPERSE 

DISTANCE 

DISTRESS 

DISTRICT 

DIVIDING 

DIVISION 

DOCTRINE 

DOMINANT 

DRESSING 


DRIFTING 

EASTERLY 

EASTWARD 

ECONOMIC 

EFFECTED 

EFFICACY 

EIGHTEEN 

ELEMENTS 

ELEVENTH 

ELIGIBLE 

EMPLOYEE 

EMPLOYER 

ENCIPHER 

ENCIRCLE 

ENFILADE 

ENGAGING 

ENGINEER 

ENLISTED 

ENORMOUS 

ENROLLED 

ENTERING 

ENTRENCH 

ENVELOPE 

EQUALIZE 

EQUIPAGE 

ESCORTED 

ESTIMATE 

EUROPEAN 

EVACUATE 

EXCAVATE 

EXCHANGE 

EXERCISE 

EXPANDED 

EXPEDITE 

EXPELLED 

EXPENDED 

EXPENSES 

EXTENDED 

EXTERIOR 

FACTIONS 

FATALITY 

FEBRUARY 

FERRYING 

FIGHTERS 

FIGHTING 

FINISHED 

FLANKING 

FLEXIBLE 

FOOTHOLD 


FORENOON 

FORTRESS 

FOURTEEN 

FRONTAGE 

FUSELAGE 

GARRISON 

GROUNDED 

GROUPING 

GUARDING 

HAVEBEEN 

HINDERED 

HOSPITAL 

HOWITZER 

IDENTIFY 

IGNITION 

IMPROPER 

IMPROVED 

INCIDENT 

INDICATE 

INDIRECT 

INFANTRY 

INFECTED 

INITIATE 

INSECURE 

INSIGNIA 

INSTRUCT 

INTEREST 

INTERIOR 

INTERNAL 

INTRENCH 

INVADING 

INVASION 

INVENTED 

JETPLANE 

JUNCTION 

LANGUAGE 

LATITUDE 

LETTERED 

LIMITING 

LOCATION 

LUMINOUS 

MAINTAIN 

MANDATED 

MANEUVER 

MARCHING 

MARITIME 

MATERIAL 

MATERIEL 

MECHANIC 


MEDICINE 

MEMORIAL 

MERCIFUL 

MESSAGES 

MIDNIGHT 

MILITARY 

MISFIRES 

MISSIONS 

MOBILIZE 

MONOPOLY 

MOUNTAIN 

MOVEMENT 

NATIONAL 

NAUTICAL 

NINETEEN 

[NORTHERN 

NOVEMBER 

[observed 

OBSERVER 

[OBSOLETE 

.OBSTACLE 

OCCUPIED 

OFFENDED 

OFFICERS 

OFFICIAL 

OPERATOR 

OPPOSING 

OPPOSITE 

ORDINATE 

ORDNANCE 

OUTBOARD 

OUTGUARD 

6UTPOSTS 

PAINTING 

PARALLAX 

PARALLEL 

PASSPORT 

PLANNING 

POLITICS 

PONTOONS 

POSITION 

POSITIVE 

PpSSlBLE 

POSTPONE 

PREPARED 

PkESERVE 

PRESSING 

PRESSURE 

PRINTING 

I 


PRIORITY 

PRISONER 

PROBABLE 

PROBABLY 

PROGRESS 

PROHIBIT 

PROTESTS 

PROTOCOL 

PURPOSES 

QUARTERS 

RAILHEAD 

RAILROAD 

RALLYING 

RECEIVER 

RECORDER 

REDCROSS 

REENLIST 

REGIMENT 

REGISTER 

REJECTED 

REJECTOR 

REMEDIES 

REMEMBER 

REPAIRED 

REPEATER 

REPELLED 

REPLACED 

REPORTED 

REPULSED 

REQUIRED 

RESEARCH 

RESERVES 

RESPECTS 

RESTORED 

RETIRING 

RETURNED 

REVIEWED 

REVOLVER 

RIGOROUS 

SABOTAGE 

SANITARY 

SATURDAY 

SCHEDULE 

SEABORNE 

SEALEVEL 

SELECTED 

SENTENCE 

SENTINEL 

SEPARATE 


SERGEANT 

SHELLING 

SHIPPING 

SIGHTING 

SKIRMISH 

SOLDIERS 

SOUTHERN 

SPECIFIC 

SPOTTING 

SQUADRON 

STANDARD 

STATIONS 

STRATEGY 

SUFFERED 

SUITABLE 

SUPERIOR 

SUPPLIES 

SURPRISE 

SURROUND 

SURVIVED 

SUSPENSE 

SWEEPING 

SWIMMING 

TACTICAL 

TAXATION 

TELEGRAM 

TERRIBLE 

TERRIFIC 

THATHAVE 

THIRTEEN 

THOUSAND 

THURSDAY 

TOMORROW 

TOTALING 

TRAILERS 

TRAINING 

TRANSFER 

TRAVERSE 

TRAWLERS 

VEHICLES 

VICINITY 

VIGOROUS 

WARSHIPS 

WESTERLY 

WESTWARD 

WINDWARD 

WIRELESS 

WITHDRAW 

WITHDREW 
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NINE LE ITER WORDS 


ACCESSORY 

ACCOMPANY 

ACCORDING 

ADDRESSED 

ADDRESSES 

ADMISSION 

ADVANCING 

ADVANTAGE 

AERODROME 

AEROPLANE 

AFTERNOON 

AGREEMENT 

AIRDROMES 

AIRPLANES 

ALLOTMENT 

ALLOWANCE 

ALTERNATE 

AMBULANCE 

AMUSEMENT 

ANNOUNCED 

ANONYMOUS 

APPARATUS 

APPOINTED 

ARBITRARY 

ARTILLERY 

ASCENSION 

ASSAULTED 

ASSISTANT 

ASSOCIATE 

ASSURANCE 

ATTACKING 

ATTEMPTED 

ATTENTION 

AUTOMATIC 

AVAILABLE 

BALLISTIC 

BAROMETER 

BATTALION 

BATTERIES 

BEACHHEAD 

BEGINNING 

BLOCKADED 

BOMBARDED 

BRIGADIER 

BUILDINGS 

CABLEGRAM 

CAMPAIGNS 

CANCELLED 

CARTRIDGE 


CENTERING 
CHALLENGE 
CHARACTER 
CHAUFFEUR 
CHRONICAL 
CIGARETTE 
CIRCULATE 
CIVILIANS 
CLEARANCE 
COALITION 
COLLAPSED 
COLLISION 
COMBATANT 
COMMANDED 
COMMANDER 
COMMITTEE 
COMPANIES 
COMPELLED 
COMPLETED 
CONDEMNED 
CONDENSED 
CONDITION 
CONFERRED 
CONFIDENT 
CONFLICTS 
CONQUERED 
CONTINUAL 
- CONTINUED 
CONTINUES 
COOPERATE 
CORRECTED 
CRITICISE 
CRITICISM 
DEBARKING 
DECREASED 
DEFECTIVE 
DEFENSIVE 
DEFICIENT 
DEPARTURE 
DEPENDENT 
DESCRIBED 
DESIGNATE 
DESTITUTE 
DESTROYED 
DESTROYER 
DETENTION 
DETERMINE 
DETONATED 
DETRAINED 


DEVELOPED 

DIETITIAN 

DIFFERENT 

DIFFICULT 

DIMENSION 

DIRECTION 

DIRIGIBLE 

DISAPPEAR 

DISCUSSED 

DISINFECT 

DISMISSAL 

DISPERSED 

DISTRICTS 

DIVISIONS 

DOMINANCE 

DOMINATED 

ECHELONED 

EFFECTIVE 

EFFICIENT 

ELABORATE 

ELEVATION 

ELSEWHERE 

EMBASSIES 

EMERGENCY 

EMPLOYING 

ENDURANCE 

ENGINEERS 

ENLISTING 

ENTRAINED 

EQUIPMENT 

ESTABLISH 

ESTIMATED 

ESTIMATES 

EXCESSIVE 

EXCLUSION 

EXCLUSIVE 

EXECUTIVE 

EXERCISES 

EXHIBITED 

EXPANSION 

EXPANSIVE 

EXPENSIVE 

EXPLOSION 

EXPLOSIVE 

EXTENDING 

EXTENSION 

EXTENSIVE 

FIFTEENTH 

FIREALARM 


FORMATION 

FORTIFIED 

FRONTLINE 

GROUPMENT 

GYROMETER 

HOSTILITY 

HURRICANE 

IDENTICAL 

IMMEDIATE 

IMPORTANT 

IMPRESSED 

INCENTIVE 

INCIDENCE 

INCIDENTS 

INCLINING 

INCLUDING 

INCLUSIVE 

INCREASED 

INDEMNITY 

INDICATED 

INFLATION 

INFLICTED 

INFLUENCE 

INHABITED 

INSTANTLY 

INTEGRITY 

INTENSIVE 

INTENTION 

INTERCEPT 

INTERDICT 

INTERFERE 

INTERMENT 

INTERPOSE 

INTERRUPT 

INTERVENE 

INTERVIEW 

INVENTION 

IRREGULAR 

KILOMETER 

LAUNCHING 

LIABILITY 

LOGISTICS 

LONGITUDE 

MAINTAINS 

MANGANESE 

MECHANISM 

MEMORANDA 

MESSENGER 

MOTORIZED 


MOVEMENTS 

MUNITIONS 

NAVALBASE 

NECESSARY 

NECESSITY 

NEGLIGENT 

NEWSPAPER 

NORTHEAST 

NORTHERLY 

NORTHWARD 

NORTHWEST 

NUMBERING 

OBJECTION 

OBJECTIVE 

OBTAINING 

OCCUPYING 

OFFENSIVE 

OFFICIALS 

OPERATING 

OPERATION 

OSCILLATE 

OUTSKIRTS 

PARACHUTE 

PARAGRAPH 

PARTITION 

PASSENGER 

PATRIOTIC 

PENETRATE 

PERMANENT 

PERSONNEL 

PLACEMENT 

POLITICAL 

POPULATED 

POSITIONS 

PRACTICAL 

PRECEDING 

PREFERRED 

PREMATURE 

PREPARING 

PRESIDENT 

PRINCIPAL 

PRINCIPLE 

PRISONERS 

PROCEDURE 

PROCEEDED 

PROJECTOR 

PROMOTION 

PROPOSALS 

PROTECTED 
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PROTECTOR 

PROTESTED 

PROVISION 

PROXIMITY 

RADIATION 

RADIOGRAM 

READINESS 

REARGUARD 

REBELLION 

RECEIVING 

RECOGNIZE 

RECOMMEND 

REENFORCE 

REFERENCE 

REFILLING 

REGARDING 

REINFORCE 

REINSTATE 

REMAINDER 

REMAINING 

REPRESENT 

REPRISALS 

REQUESTED 

REQUIRING 

RESOURCES 

RESTRAINT 

RETENTION 

RETURNING 

REVIEWING 

SCREENING 

SEAPLANES 

SECRETARY 

SEMICOLON 

SEMIRIGID 

SEPTEMBER 

SERIOUSLY 

SERVICING 

SEVENTEEN 

SHELLFIRE 

SITUATION 

SIXTEENTH 

SOUTHEAST 

SOUTHWARD 

SOUTHWEST 

SPEARHEAD 

STANDARDS 

STATEMENT 

STRAGGLER 

STRATEGIC 
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SUBMITTED 

SUSPENDED TELEPHONE THEREFORE UNTENABLE 

SUCCEEDED 

SUSPICION TENTATIVE TRANSPORT VARIATION 

SURRENDER 

TECHNICAL TERRITORY TWENTIETH WATERTANK 

SUSPECTED 

TECHNIQUE 





TEN LETTER WORDS 


ACCEPTABLE 

COLLISIONS 

DESPATCHES 

EXPENDABLE 

ACCEPTANCE 

COMMANDANT 

DESTROYERS 

EXPERIENCE 

ACCIDENTAL 

COMMANDEER 

DETACHMENT 

EXPERIMENT 

ACCORDANCE 

COMMANDING 

DETERMINED 

EXPLOSIONS 

ACTIVITIES 

COMMISSARY 

DETONATION 

EXTINGUISH 

ADDITIONAL 

COMMISSION 

DETRAINING 

FACILITIES 

AIRCONTROL 

COMMITMENT 

DETRUCKING 

FLASHLIGHT 

AIRSUPPORT 

COMMUNIQUE 

DIFFERENCE 

FORMATIONS 

ALLEGIANCE 

COMPENSATE 

DIPLOMATIC 

FOUNDATION 

ALLOCATION 

COMPLETELY 

DIRECTIONS 

FOURTEENTH 

AMBASSADOR 

COMPRESSED 

DISCIPLINE 

FRONTLINES 

AMMUNITION 

CONCERNING 

DISCUSSION 

GEOGRAPHIC 

ANTEDATING 

CONCESSION 

DISPATCHED 

GONIOMETER 

ANTICIPATE 

CONCLUSION 

DISPATCHER 

GOVERNMENT 

APPARENTLY 

CONDITIONS 

DISPATCHES 

GYROSCOPIC 

APPEARANCE 

CONFERENCE 

DISPERSION 

HYDROMETER 

APPROACHED 

CONFESSION 

DISTRESSED 

HYGROMETER 

ARMOREDCAR 

CONFIDENCE 

DISTRIBUTE 

ILLITERATE 

ARTIFICIAL 

CONNECTING 

DIVEBOMBER 

ILLUMINATE 

ASPOSSIBLE 

CONNECTION 

DOMINATION 

ILLUSTRATE 

ASSEMBLIES 

CONSPIRACY 

EFFICIENCY 

IMPASSIBLE 

ASSESSMENT 

CONSTITUTE 

EIGHTEENTH 

IMPOSSIBLE 

ASSIGNMENT 

CONTINGENT 

ELEMENTARY 

IMPRESSION 

ASSISTANCE 

CONTINUOUS 

EMPLOYMENT 

IMPRESSIVE 

ATOMICBOMB 

CONTRABAND 

ENCIPHERED 

INCENDIARY 

ATTACHMENT 

CONVENIENT 

ENCIRCLING 

INDICATING 

ATTAINMENT 

COORDINATE 

ENEMYTANKS 

INDICATION 

ATTEMPTING 

CORRECTION 

ENGAGEMENT 

INDIVIDUAL 

AUDIBILITY 

CREDENTIAL 

ENLISTMENT 

INFLICTING 

AUTOMOBILE 

CROSSROADS 

ENROLLMENT 

INSECURITY 

BALLISTICS 

DEBOUCHING 

ENTERPRISE 

INSPECTION 

BATTLESHIP 

DECIPHERED 

ENTRENCHED 

INSTRUCTED 

BEENNEEDED 

DECORATION 

ENTRUCKING 

INSTRUCTOR 

BRIDGEHEAD 

DEDICATION 

EQUIVALENT 

INSTRUMENT 

CAMOUFLAGE 

DEFICIENCY 

ESTIMATION 

INTERNMENT 

CAPABILITY 

DEFINITION 

EVACUATING 

INVITATION 

CASUALTIES 

DEMOBILIZE 

EVACUATION 

IRRIGATION 

CENSORSHIP 

DEPARTMENT 

EVALUATION 

KILOMETERS 

CENTRALIZE 

DEPENDABLE 

EXCAVATION 

LABORATORY 

CIRCUITOUS 

DEPLOYMENT 

EXCITEMENT 

LIEUTENANT 

COASTGUARD 

DEPRESSION 

EXHIBITION 

LIMITATION 

COLLECTING 

DESIGNATED 

EXPEDITING 

LOCOMOTIVE 

COLLECTION 

DESPATCHED 

EXPEDITION 

MACHINEGUN 


HEPTRICTEB - 
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WEDNESDAY 

WITNESSES 

YESTERDAY 


MAINTAINED 

MANAGEMENT 

MECHANIZED 

MEMORANDUM 

MILLIMETER 

MOTORCYCLE 

NATURALIZE 

NAVIGATION 

NEGLIGENCE 

NEWSPAPERS 

NINETEENTH 

OBJECTIVES 

OCCUPATION 

ONEHUNDRED 

OPERATIONS 

OPPOSITION 

OVERCOMING 

PATROLLING 

PERMISSION 

PERSISTENT 

PHOSPHORUS 

POPULATION 

POSSESSION 

POSTOFPICE 

PRECEDENCE 

PREFERENCE 

PRESCRIBED 

PROHIBITED 

PROPORTION 

PROTECTION 

PROVISIONS 

QUARANTINE 

RECEPTACLE 

RECREATION 

RECRUITING 

REENFORCED 

REENLISTED 

REGIMENTAL 

REGULATION 

REINFORCED 

RESISTANCE 

RESPECTFUL 

RESTRICTED 
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REVOLUTION 

SANITATION 

SEPARATION 

SIGNALLING 

SIMILARITY 

STATISTICS 

SUBMARINES 


SUBMISSION 

SUBSTITUTE 

SUCCESSFUL 

SUCCESSIVE 

SUFFICIENT 

SUPPORTING 


SUSPENSION 

SUSPICIONS 

SUSPICIOUS 

THIRTEENTH 

THREATENED 

TRAJECTORY 


TRANSPORTS 

TPAN$VERSE 

TROOPSHIPS 

TWENTYFjCVE 

UNDERSTAND 

UNDERSTOOD 


UNEXPENDED 

UNSUITABLE 

VICTORIOUS 

VISIBILITY 

WILLATTACK 

WITHDRAWAL 


ACCESSORIES 

AERONAUTICS 

ALTERNATING 

APPLICATION 

APPOINTMENT 

APPROACHING 

APPROPRIATE 

APPROXIMATE 

ARBITRATION 

ARMOREDCARS 

ARRANGEMENT 

ASSESSMENTS 

ASSIGNMENTS 

ASSOCIATION 

BATTLEFIELD 

BATTLESHIPS 

BELLIGERENT 

BLOCKBUSTER 

BOMBARDMENT 

CATASTROPHE 

CERTIFICATE 

CIRCULATION 

COEFFICIENT 

COINCIDENCE 

COMMUNICATE 

COMMUNIQUES 

COMPARTMENT 

COMPETITION 

COMPOSITION 

COMPUTATION 

CONCEALMENT 


ADVANTAGEOUS 

AGRICULTURAL 

ANNOUNCEMENT 

ANTIAIRCRAFT 

ANTICIPATION 

BREAKTHROUGH 

CANCELLATION 


ELEVEN LETTER WORDS 


CONCENTRATE 

CONFINEMENT 

CONSTITUTED 

CONSUMPTION 

CONTINENTAL 

CONTROVERSY 

COOPERATION 

CORPORATION 

CORRECTNESS 

CREDENTIALS 

CUSTOMHOUSE 

DEBARKATION 

DEMONSTRATE 

DESCRIPTION 

DESCRIPTIVE 

DESIGNATION 

DESTRUCTION 

DETERIORATE 

DEVELOPMENT 

DISAPPEARED 

DISCONTINUE 

DISCREPANCY 

DISINFECTED 

DISPOSITION 

DISTINCTION 

DISTINGUISH 

DYNAMOMETER 

ECHELONMENT 

EFFECTIVELY 

ELECTRICITY 

EMBARKATION 


EMPLACEMENT 

ENCOUNTERED 

ENEMYPLANES 

ENFORCEMENT 

ENGAGEMENTS 

ENGINEERING 

ESTABLISHED 

ESTIMATEDAT 

EXAMINATION 

EXPLANATION 

EXTENSIVELY 

EXTERMINATE * 

FINGERPRINT , 

FIRECONTROL I 

HEAVYBOMBER 

HEAVYLOSSES ' 

HOSTILITIES 

IMMEDIATELY 

IMMIGRATION 

IMPEDIMENTA 

IMPROVEMENT 

INCOMPETENT 

INDEPENDENT 

INFLAMMABLE 

INFORMATION 

INSPIRATION 

INSTITUTION 

INSTRUCTION 

INSTRUMENTS 

INTELLIGENT 

INTERCEPTED 


INTERfcERPTS 

INTERESTING 

INTERFERING 

INTERPRETER 

INTERRUPTED 

INTERVENING 

INVESTIGATE 

LEGISLATION 

LIGHTBOMBER 

MAINTENANCE 

MANUFACTURE 

MEASUREMENT 

NATIONALISM 

NATIONALITY 

NAVALATTACK 

NAVALBATTLE 

NAVALFORCES 

NECESSITATE 

OBSERVATION 

OVERWHELMED 

PARENTHESIS 

PARENTHESES 

PENETRATION 

PERFORMANCE 

PHILIPPINES 

PHOTOGRAPHY 

PREARRANGED 

PREPARATION 

PRELIMINARY 

PROGRESSIVE 

RANGEFINDER 


TWELVE LETTER WORDS 


CARELESSNESS 

COMMENCEMENT 

COMMENDATION 

COMMISSIONED 

COMMISSIONER 

COMPENSATION 

COMPLETENESS 


CONCENTRATED 

CONCILIATION 

CONFIDENTIAL 

CONFIRMATION 

CONFISCATION 

CONFORMATION 

CONSCRIPTION 


CONSIDERABLE 

CONSTITUTING 

CONSTITUTION 

CONSTRUCTION 

CONTINUATION 

CONVALESCENT 

CONVERSATION 


REAPPOINTED 

RECOGNITION 

RECOMMENDED 

RECONNOITER 

REPLACEMENT 

REQUIREMENT 

REQUISITION 

RESERVATION 

RESIGNATION 

RESPONSIBLE 

RESTRICTION 

RETALIATION 

RETROACTIVE 

SCHOOLHOUSE 

SEVENTEENTH 

SEVENTYFIVE 

SIGNIFICANT 

SMOKESCREEN 

STRATEGICAL 

SUBSISTENCE 

SUITABILITY 

SUPERIORITY 

SURRENDERED 

SYNCHRONIZE 

TEMPERATURE 

THERMOMETER 

TOPOGRAPHIC 

TRADITIONAL 

TRANSFERRED 

WITHDRAWING 


COORDINATION 

DECENTRALIZE 

DECIPHERMENT 

DEMONSTRATED 

DEPARTMENTAL 

DIFFICULTIES 

DISORGANIZED 


n 
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RESTRICTED 


DISPLACEMENT 

DISSEMINATED 

DISTRIBUTING 

DISTRIBUTION 

EMPLACEMENTS 

ENCIPHERMENT 

ENTANGLEMENT 

ENTERPRISING 

FIGHTERPLANE 

GENERALALARM 

GENERALSTAFF 

GEOGRAPHICAL 

HEADQUARTERS 

HEAVYBOMBERS 


ACCOMMODATION 

APPROXIMATELY 

CHRONOLOGICAL 

CIRCUMSTANCES 

COMMUNICATION 

CONCENTRATING 

CONCENTRATION 

CONGRESSIONAL 

CONSIDERATION 


ADMINISTRATION 

ADMINISTRATIVE 

CENTRALIZATION 

CHARACTERISTIC 

CIRCUMSTANTIAL 

CLASSIFICATION 

CORRESPONDENCE 


TWELVE LETTER WORDS—Continued 


HYDROGRAPHIC 

ILLUMINATING 

ILLUMINATION 

ILLUSTRATION 

INAUGURATION 

INCOMPETENCE 

INEFFICIENCY 

INSTRUCTIONS 

INTELLIGENCE 

INTERDICTION 

INTERFERENCE 

INTERMEDIATE 

INTERRUPTION 

INTERVENTION 


INTRODUCTION 

INTRODUCTORY 

IRREGULARITY/ 

LIGHTBOMBERS 

MARKSMANSHIP 

MEASUREMENTS 

MEDIUMBOMBER 

MOBILIZATION 

NONCOMBATANT 

NORTHWESTERN 

OBSTRUCTIONS 

ORGANIZATION 

PREPARATIONS 

PREPAREDNESS 


PRESERVATION 

PRESIDENTIAL 

PROCLAMATION 

PSYCHROMETER 

RADIOSTATION 

RECREATIONAL 

REENLISTMENT 

REGISTRATION 

REPLACEMENTS 

RESPECTFULLY 

ROADJUNCTION 

SATISFACTORY 

SEARCHLIGHTS 

SHARPSHOOTER 


SIGNIFICANCE 

SIMULTANEOUS 

SOUTHWESTERN 

SUBSTITUTION 

SUCCESSFULLY 

TRANSFERRING 

TRANSMISSION 

TRANSPACIFIC 

UNIDENTIFIED 

UNITEDSTATES 

UNSUCCESSFUL 

VERIFICATION 

VETERINARIAN 


THIRTEEN LETTER WORDS 


CORRESPONDING 

COUNTERATTACK 

DECENTRALIZED 

DEMONSTRATION 

DEPENDABILITY 

DETERMINATION 

DISAPPEARANCE 

DISCREPANCIES 

DISSEMINATION 


DISTINGUISHED 

ENTERTAINMENT 

ESTABLISHMENT 

EXTERMINATION 

EXTRAORDINARY 

FIGHTERPLANES 

IMPRACTICABLE 

INDETERMINATE 

INSTALLATIONS 


INSTANTANEOUS 

INTERNATIONAL 

INVESTIGATION 

MEDIUMBOMBERS 

MISCELLANEOUS 

PRELIMINARIES 

QUALIFICATION 

QUARTERMASTER 

REAPPOINTMENT 


REENFORCEMENT 

REIMBURSEMENT 

REINFORCEMENT 

REINSTATEMENT 

REVOLUTIONARY 

SPECIFICATION 

TRANSATLANTIC 

WARDEPARTMENT 


FOURTEEN LETTER WORDS 


DEMOBILIZATION 

DISCONTINUANCE 

DISTINGUISHING 

IDENTIFICATION 

INTERPRETATION 

INVESTIGATIONS 


IRREGULARITIES 

METEOROLOGICAL 

NATURALIZATION 

RECOMMENDATION 

RECONNAISSANCE 

RECONNOITERING 


RECONSTRUCTION 

REORGANIZATION 

REPRESENTATIVE 

RESPONSIBILITY 

SATISFACTORILY 

TRANSPORTATION 
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B. LIST OF WORDS USED IN MILITARY TEXT ARRANGED IN RHYMING ORDER 

ACCORDING TO WORD LENGTH 


THREE LETTER WORDS 


SEA 

SEE 

MAJ 

TAN 

TOP 

lEAT 

APT 

TAX 

JOB 

ACE 

ADJ 

GEN 

GHQ 

•MAT 

BUT 

FIX 

ROB 

SHE 

ASK 

MEN 

BAR 

VAT 

CUT 

MIX 

TUB 

THE 

GAL 

PEN 

CAR 

ACT 

OUT 

SIX 

fQMC 

DIE 

ALL 

TEN 

FAR 

GET 

PUT 

BOX 

ARC 

ONE 

ILL 

PIN 

PAR 

LET 

PVT 

FOX 

BAD 

ARE 

COL 

TIN 

WAR 

NET 

CWT 

DAY 

HAD 

USE 

v CPL 

TON 

HER 

SET 

YOU 

LAY 

ADD 

DUE 

CAM 

WON 

PER 

WET 

CAV 

PAY 

RED 

OWE 

HAM 

DUN 

AIR 

YET 

LAW 

SAY 

AID 

EYE 

AIM 

GUN 

>OR 

SGT 

SAW 

WAY 

BID 

OFF 

HIM 

RUN 

I 1 OUR 

WGT 

FEW 

ANY 

DID 

BAG 

ARM 

SUN 

GAS 

FIT 

NEW 

SPY 

RID 

KEG 

SUM 

OWN 

HAS 

GOT 

HOW 

DRY 

OLD 

BIG 

CAN 

AGO 

WAS 

LOT 

LOW 

TRY 

AND 

JIG 

MAN 

TOO 

HIS 

NOT 

NOW 

BUY 

END 

DOG 

NAN 

TWO 

ITS 





TOUR LETTER WORDS 


AREA 

MIKE 

BASE 

WEEK 

FELL 

JOIN 

PASS 

LIST 

ASIA 

YOKE 

FUSE 

T^K 

Nell 

NOON 

LESS 

LOST 

BULB 

ABLE 

DATE 

BULK 

HILL 

SOON 

MESS 

POST 

BOMB 

FILE 

LATE 

RANK 

WILL 

DOWN 

LOSS 

JUST 

HEAD 

MILE 

NOTE 

SANK 

FULL 

TOWN 

HITS 

ROUT 

LEAD 

MULE 

BLUE 

TANK 

TOOL 

ZERO 

DAYS 

NEXT 

LOAD 

RULE 

HAVE 

SUNK 

TEAM 

ALSO 

MEAT 

TEXT 

ROAD 

SAME 

FIVE 

BOOK 

.THEM 

INTO 

\ THAT 

LIEU 

RAID 

TIME 

LOVE 

COOK 

ITEM 

KEEP 

WHAT 

DRAW 

SAID 

SOME 

WOVE, 

HOOK 

MAIM 

SHIP 

FEET 

XRAY 

HOLD 

LINE 

fuze' 

LOOK 

FROM 

DUMP 

MEET 

AWAY 

HAND 

MINE 

HALF 

TOOK 

FARM 

PUMP 

LEFT 

BODY 

LAND 

NINE 

FLAG 

'DARK 

FIRM 

STOP 

OMIT 

THEY 

KIND 

ZONE 

KING 

PARK 

FORM 

NEAR 

UNIT 

ALLY 

HARD 

JUNE 

LONG ' 

mask 

THAN 

REAR 

HALT 

ONLY 

HERD 

OBOE 

EACH 

TASK 

PLAN 

OVER 

TENT 

JULY 

ONCE 

PIPE 

HIGH 

ORAL 

BEEN 

FOUR 

SHOT 

ARMY 

MADE 

TYPE 

DASH 

MTCL 

SEEN 

EYES 

RIOT 

MANY 

AIDE 

TARE 

pusp 

FEEL 

THEN 

THIS 

DIRT v 

VARY 

SIDE 

HERE 

RUSH 

RAIL 

WHEN 

AXIS 

EAST 

VERY 

CODE 

WERE 

WITH 

CALL 

OPEN 

TONS 

FAST 

EASY 

FLEE 

FIRE 

BOTH 

FALL 

MAIN 

GUNS 

LAST 

CITY 

EDGE 

WIRE 

LEAK 

CELL 

RAIN 

MASS 

WEST 

NAVY 

TAKE 

MORE 

BACK 
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REDTRt 

COMMA 

CTEU 

SCALE 

ALONG 

FIVE LETTER WORDS 

CANAL WAGON 

1 

PRIOR 

DRESS 

START 

ZEBRA 

TITLE 

AMONG 

FATAL 

UNION 

MAJOR 

PRESS 

ALERT 

SQUAD 

ALINE 

BEACH 

VITAL 

COLON 

VALOR 

CROSS 

LEAST 

SPEED 

SLOPE 

REACH 

TOTAL 

DRAWN 

ARMOR 

FLATS 

COAST 

WIPED 

FLARE 

WHICH 

EQUAL 

RADIO 

HONOR 

BOATS 

CREST 

RIGID 

THERE 

MARCH 

USUAL 

EQUIP 

ERROR 

RAFTS 

GUEST 

RAPID 

WHERE 

WEIGH 

NAVAL 

TROOP 

MOTOR 

UNITS 

FIRST 

FIELD 

SHORE 

FRESH 

WHEEL 

GROUP 

AREAS 

TRACT 

BURST 

BLIND 

CEASE 

WIDTH 

STEEL 

CLEAR 

BOMBS 

FLEET 

ABOUT 

GUARD 

ERASE 

FIFTH 

REPEL 

SUGAR 

RAIDS 

QUIET 

ALLOW 

AWARD 

THESE 

TENTH 

LEVEL 

UNDER 

WOODS 

ASSET 

ANNEX 

THIRD 

CLOSE 

NINTH 

APRIL 

ORDER 

YARDS 

SHIFT 

TODAY 

BRIBE 

HORSE 

BOOTH 

SMALL 

DEFER 

MILES 

EIGHT 

DELAY 

PLACE 

CAUSE 

DEPTH 

SHELL 

REFER 

FIRES 

FIGHT 

READY 

VOICE 

HOUSE 

NORTH 

SPELL 

EAGER 

CASES 

LIGHT 

FOGGY 

FORCE 

ROUTE 

SOUTH 

DRILL 

ROGER 

GATES 

NIGHT 

DAILY 

TRUCE 

ISSUE 

SIXTH 

ALARM 

ETHER 

PACKS 

RIGHT 

RALLY 

THREE 

LEAVE 

BREAK 

JAPAN 

OTHER 

DECKS 

SIGHT 

APPLY 

RIDGE 

DRIVE 

BLACK 

QUEEN 

BAKER 

-DOCKS 

AWAIT 

EARLY 

SIEGE 

PROVE 

CHECK 

TAKEN 

LATER 

BANKS 

SPLIT 

ENEMY 

RANGE 

CURVE 

QUICK 

SEVEN 

METER 

TANKS 

LIMIT ’ 

EVERY 

BARGE 

SEIZE 

TRUCK 

GIVEN 

PETER 

PLANS 

VISIT 

FERRY 

LARGE 

CHIEF 

CREEK 

ALIGN 

AFTER 

SHIPS 

AGENT 

FIFTY 

GAUGE 

STAFF 

FLANK 

AGAIN 

ENTER 

CORPS 

POINT 

PARTY 

STAKE 

PROOF 

CLERK 

PLAIN 

RIVER 

FEARS 

FRONT 

FORTY 

SMOKE 

BEING 

LOCAL 

TRAIN 

COVER 

PAIRS 

COUNT 

SIXTY 

BROKE 

GOING 

VOCAL 

BEGIN 

THEIR 

HOURS 

DEPOT 

HEAVY 

CANADA 

HALTED 

DEVICE 

SIX LETTER WORDS 

CHARGE SEVERE 

ARRIVE 

TRENCH 

MANUAL 

ARABIA 

ROUTED 

NOVICE 

GEORGE 

RETIRE 

ACTIVE 

LAUNCH 

ANNUAL 

ALASKA 

LIQUID 

FIERCE 

REFUGE 

ENTIRE 

TWELVE 

SEARCH 

CASUAL 

PANAMA 

INLAND 

REDUCE 

MORALE 

BEFORE 

BREEZE 

CHURCH 

VISUAL 

METRIC 

ISLAND 

PARADE 

UNABLE 

SECURE 

RELIEF 

SWITCH 

CANCEL 

CRITIC 

DEFEND 

DECIDE 

CIRCLE 

ASSURE 

ZIGZAG 

THOUGH 

VESSEL 

BOMBED 

OFFEND 

DIVIDE 

SINGLE 

FUTURE 

RIDING 

FINISH 

DETAIL 

BARBED 

DEPEND 

DECODE 

MOBILE 

GREASE 

FILING 

EIGHTH 

REFILL 

RAIDED 

EXPEND 

ENCODE 

BEETLE 

CHEESE 

LINING 

FOURTH 

ENROLL 

LANDED 

INTEND 

COFFEE 

BATTLE 

ADVISE 

Mining 

ATTACK 

SCHOOL 

WOODED 

EXTEND 

DECREE 

SETTLE 

DEVISE 

FIRING 

DEBARK 

PATROL 

INDEED 

SECOND 

DEGREE 

LITTLE 

OPPOSE 

WIRING 

EMBARK 

PISTOL 

ALLIED 

BEYOND 

STRAFE 

NOZZLE 

COURSE 

DURING 

VERBAL 

SYSTEM 

KILLED 

GROUND 

ENGAGE 

MUZZLE 

REFUSE 

NOTING 

RADIAL 

VICTIM 

FORMED 

METHOD 

DAMAGE 

SCHEME 

LOCATE 

MOVING » 

SERIAL 

SIGCOM 

DOWNED 

PERIOD 

MANAGE 

RESUME 

EXCITE 

FLYING 

ANIMAL 

BOTTOM 

SCORED 

RECORD 

GARAGE 

ENGINE 

MINUTE 

BREACH 

FORMAL 

INFORM 

PASSED 

OFFICE 

BRIDGE 

RAVINE 

RESCUE 

DETACH 

NORMAL 

MEDIUM 

CAUSED 

POLICE 

ALLEGE 

EUROPE 

LEAGUE 

ATTACH 

SIGNAL 

SUDDEN 

UNITED 

ADVICE 

CHANGE 

SPHERE 

PURSUE 

BRANCH 

POSTAL 

SCREEN 




I 


RESTRICT ED 


3-12 



REF ID:A56892 


RESTRICTED 


SIX LETTER WORDS—Continued 


SUNKEN 

MORTAR 

RUNNER 

FORCES 

COLORS 

TARGET 

CANNOT* 

MONDAY 

BROKEN 

RUBBER 

KEEPER 

BARGES 

ACCESS 

PICKET 

ACCEPT 

SUNDAY 

SEAMEN 

MEMBER 

HELPER 

BODIES 

EXCESS 

ROCKET 

EXCEPT 

ANYWAY 

HAPPEN 

BOMBER 

PROPER 

ALLIES 

Unless 

BILLET 

PROMPT 

REMEDY 

BATTEN 

NUMBER 

NEARER 

ARMIES 

stress 

TURRET 

DEPART 

VALLEY 

ELEVEN 

PINCER 

ERASER 

TABLES 

ACROSS 

I SUNSET 

DESERT 

PARLEY 

REMAIN 

LEADER 

CENTER 

PLANES 

A&jS^TS 

1 WEIGHT 

DIVERT 

CONVEY 

ATTAIN 

LADDER 

BETTER 

PASSES 

VISITS 

FLIGHT 

ESCORT 

SURVEY 

WITHIN 

MURDER 

LETTER 

LOSSES 

, POINTS 

SLIGHT 

EFFORT 

VERIFY 

COLUMN 

PREFER 

BITTER 

STATES 

STATUS 

‘NAUGHT 

'report 

SUPPLY 

RATION 

SUFFER 

LITTER 

ROUTES 

ALWAYS 

FOUGHT i 

' ARREST 

HOURLY 

ACTION 

MEAGER 

AFFAIR 

ISSUES 

COMBAT ' 

NOUGHT i 
CREDIT / 
SUBMIT/ 

RESIST 

DEPLOY 

COMMON 

HIGHER 

REPAIR 

CRISIS 

DEFEAT , 

ASSIST 

EMPLOY 

SUMMON 

CIPHER 

HARBOR 

SHELLS 

THREAT 

AUGUST 

CONVOY 

POISON 

EITHER 

TERROR 

SPOOLS 

DEFECT | 

COMMIT/ 

ADJUST 

OCCUPY 

LESSON 

TANKER 

MIRROR 

TRAINS 

EFFECT j 

SUMMIT 

DUGOUT 

SALARY 

PONTON 

HAMMER 

SECTOR 

SPOONS 

REJECT^ , 

RESUl/r 

OUTPUT 

ARMORY 

RETURN 

SUMMER 

VICTOR 

STRIPS 

SELECT, t 

ORIENT 

BUREAU 

NINETY 

DRYRUN 

BANNER 

DOCTOR 

TROOPS 

EXPECT 

INTENT 

REVIEW 

EIGHTY 

TATTOO 

MANNER 

CANVAS 

ORDERS 

DIRECT 

EXTENT 

FOLLOW 

TWENTY 

APPEAR 

DOLLAR 

GUNNER 

PLACES 

OTHERS 

( 'STREET 

1 

INVENT 

FRIDAY 

THIRTY 


SEVEN LETTER WORDS 


MILITIA 

COVERED 

REFUGEE 

WARFARE 

PROMOTE 

► 

FORGING 

VARYING 

ANTENNA 

RETIRED 

WINDAGE 

DECLARE 

dOMMUTE 

FISHING 

ICEBERG 

ALMANAC 

ARMORED 

BAGGAGE 

PREPARE 

REVENUE 

PUSHING ' 

DEBOUCH 

BIVOUAC 

PRESSED 

PACKAGE 

CALIBRE 

RELIEVE 

NOTHING 

THROUGH 

TRAFFIC 

CROSSED 

VILLAGE 

MISFIRE 

Receive 

TALKING 

FURNISH 

PACIFIC 

OMITTED 

TONNAGE 

INSPIRE 

passive 

SINKING 

TWELFTH 

ASIATIC 

DELAYED 

AVERAGE 

REQUIRE 

CAPTIVE 

SMOKING < 

SEVENTH 

REDUCED 

COMMAND 

STORAGE 

INQUIRE 

REVOLVE 

FALLING 

SETBACK 

INVADED 

COMMEND 

BARRAGE 

LECTURE 

APPROVE 

FILLING ' 

DERRICK 

DECIDED 

SUSPEND 

PASSAGE 

RELEASE 

9BSERVE 

KILLING 

DETRUCK 

DECODED 

RESPOND 

MESSAGE 

DISEASE 

■ RESERVE 

RAINING 

ENTRUCK ‘ 

ENCODED 

BOMBARD 

COLLEGE 

SUNRISE 

/ANALYZE 

MANNING 

‘ MEDICAL 

WOUNDED 

AWKWARD 

ARRANGE 

LICENSE 

/JUMPOFF 

RUNNING 

LOGICAL 

GUARDED 

FORWARD 

WITHTHE 

DEFENSE 

1 BOMBING 

MORNING 

CONCEAL 

PROCEED 

REPLACE 

thattHe 

OFFENSE 

PLACING 

SLOPING 

ILLEGAL 

ENGAGED 

SERVICE 

CHARLIE 

PROPOSE , ' 

FORCING 

MAPPING 

marshal 

DAMAGED 

ADVANCE 

PRAIRIE 

SUPPOSE 

HEADING 

Rearing 

INITIAL 

REACHED 

ABSENCE 

VISIBLE 

purpose! 

LEADING 

GASSING 

MARTIAL 

MARCHED 

ENFORCE 

BICYCLE 

REVERSE 

LOADING 

MESSING 

FEDERAL 

WRECKED 

BRIGADE 

HOSTILE 

because/ 

BEDDING 

MISSING 

GENERAL 

SHELLED 

GRENADE 

EXTREME 

MANDATE 

RAIDING 

LIFTING 

SEVERAL 

DROPPED 

PRECEDE 

CONFINE 

RADIATE 

HOLDING 

Halting 

CENTRAL 

STOPPED 

OUTSIDE 

MACHINE 

OPERATE 

LANDING 

GETTING 

NATURAL 

HUNDRED 

INCLUDE 

ROUTINE 

elevAte 

BINDING 

FITTING 

COASTAL 

ORDERED 

EXCLUDE 

Cyclone 

ENTENTE 

FINDING 

ISSUING 

GRADUAL 
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SEVEN LETTER WORDS—Continued 


UNUSUAL 

ENTRAIN 

ENVELOP 

STARTER 

SUCCESS 

ASSAULT 

RAILWAY 

ARRIVAL 

CONTAIN 

SIMILAR 

QUARTER 

USELESS 

INSTANT 

SECRECY 

CHANNEL 

CAPTAIN 

REGULAR 

DELIVER 

ILLNESS 

ELEMENT 

VACANCY 

COLONEL 

CONDEMN 

CALIBER 

RECOVER 

WITNESS 

COMMENT 

SIGNIFY 

COUNCIL 

ABANDON 

OCTOBER 

AVIATOR 

ADDRESS 

CURRENT 

SATISFY 

FUELOIL 

OPINION 

OFFICER 

TRACTOR 

EXPRESS 

PRESENT 

RAPIDLY 

INSTALL 

SESSION 

POUNDER 

VISITOR 

DISMISS 

APPOINT 

QUICKLY 

DISTILL 

MISSION 

TRIGGER 

TACTICS 

DISCUSS 

RECEIPT 

NIGHTLY 

PAYROLL 

STATION 

WEATHER 

ISLANDS 

TARGETS 

ATTEMPT 

SHORTLY 

CONTROL 

SECTION 

WHETHER 

CHANGES 

SURPLUS 

SUPPORT 

COMPANY 

WILLIAM 

ECHELON 

ANOTHER 

ENEMIES 

RETREAT 

SUGGEST 

DESTROY 

DIAGRAM 

BALLOON 

FARTHER 

BATTLES 

EXTRACT 

HIGHEST 

PRIMARY 

PROGRAM 

PLATOON 

FURTHER 

GLASSES 

CONTACT 

NEAREST 

SUMMARY 

MINIMUM 

LIAISON 

SOLDIER 

CHASSIS 

COLLECT 

PROTEST 

LIBRARY 

‘ MAXIMUM 

HORIZON 

CARRIER 

ATTACKS 

RESPECT 

REQUEST 

JANUARY 

HAS5EEN 

EASTERN 

COURIER 

VESSELS 

CORRECT 

AGAINST * 

BRIBERY 

FIFTEEN 

WESTERN 

HEAVIER 

PATROLS 

PROTECT 

OUTPOST 

BATTERY 

SIXTEEN 

FOGHORN 

TRAWLER 

BOMBERS 

INFLICT 

PROVOST 

INQUIRY 

BETWEEN 

UNKNOWN 

STEAMER 

NUMBERS 

CONDUCT 

BOYCOTT 

CAVALRY 

KITCHEN 

TOBACCO 

CLIPPER 

REPAIRS 

TONIGHT 

WITHOUT 

VICTORY 

WRITTEN 

TORPEDO 

CRUISER 

SAILORS 

CIRCUIT 

LOOKOUT 

EMBASSY 

EXPLAIN 

WARSHIP 

AMMETER 

SECTORS 

RECRUIT 

SIMPLEX 

UTILITY 

TERRAIN 

DETRAIN 

DEVELOP 

FIGHTER 

COMPASS 

PURSUIT 

TUESDAY 

SEVENTY 


EIGHT LETTER WORDS 


INSIGNIA 

EXPELLED 

DICTATED 

STANDARD 

LANGUAGE 

ENVELOPE 

OPPOSITE 

SPECIFIC 

ENROLLED 

EFFECTED 

OUTBOARD 

DISLODGE 

INSECURE 

CONTINUE 

TERRIFIC 

DISARMED 

INFECTED 

OUTGUARD 

EXCHANGE 

PRESSURE 

CRITIQUE 

ECONOMIC 

ASSIGNED 

REJECTED 

WINDWARD 

PROBABLE 

DECREASE 

THATHAVE 

MECHANIC 

RETURNED 

SELECTED 

EASTWARD 

SUITABLE 

EXERCISE 

DECISIVE 

ATLANTIC 

APPEARED 

BILLETED 

WESTWARD 

ELIGIBLE 

SURPRISE 

POSITIVE 

RAILHEAD 

DECLARED 

INVENTED 

DESCRIBE 

TERRIBLE 

SUSPENSE 

PRESERVE 

RAILROAD 

PREPARED 

DEPARTED 

ORDNANCE 

POSSIBLE 

DISPERSE 

EQUALIZE 

REPLACED 

HINDERED 

DESERTED 

DISTANCE 

FLEXIBLE 

TRAVERSE 

MOBILIZE 

ADVANCED 

SUFFERED 

ESCORTED 

COMMENCE 

ASSEMBLE 

DEDICATE 

INVADING 

DEMANDED 

CENTERED 

DEPORTED 

SENTENCE 

OBSTACLE 

INDICATE 

DIVIDING 

EXPANDED 

BATTERED 

REPORTED 

ANNOUNCE 

ENCIRCLE 

INITIATE 

BUILDING 

DEFENDED 

LETTERED 

ARRESTED 

COMMERCE 

SCHEDULE 

ESTIMATE 

GUARDING 

OFFENDED 

REPAIRED 

ENLISTED 

ENFILADE 

MARITIME 

ORDINATE 

ENGAGING 

EXPENDED 

REQUIRED 

SURVIVED 

CONCLUDE 

AIRDROME 

DETONATE 

DAMAGING 

EXTENDED 

RESTORED 

IMPROVED 

LATITUDE 

AIRPLANE 

SEPARATE 

MARCHING 

GROUNDED 

DEFERRED 

OBSERVED 

ALTITUDE 

JETPLANE 

EVACUATE 

BREAKING 

BESIEGED 

CAPTURED 

REVIEWED 

EMPLOYEE 

MEDICINE 

EXCAVATE 

FLANKING 

DETACHED 

REPULSED 

DEPLOYED 

CARRIAGE 

DOCTRINE 

OBSOLETE 

TOTALING 

FINISHED 

COMPOSED 

AIRFIELD 

FUSELAGE 

POSTPONE 

COMPLETE 

SHELLING 

OCCUPIED 

MANDATED 

FOOTHOLD 

EQUIPAGE 

SEABORNE 

CONCRETE 

BATTLING 

ATTACKED 

DEFEATED 

THOUSAND 

FRONTAGE 

AIRBORNE 

EXPEDITE 

SWIMMING 

REPELLED 

REPEATED 

SURROUND 

SABOTAGE 

DEVELOPE 

DEFINITE 

TRAINING 
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EIGHT LETTER WORDS—Continued 


PLANNING 

SWEEPING 

SHIPPING 

GROUPING 

ENTERING 

COVERING 

RETIRING' 

ADVISING 

OPPOSING 

DRESSING 

PRESSING 

CROSSING 

DRIFTING 

FIGHTING 

SIGHTING 

LIMITING 

PAINTING 

PRINTING 

SPOTTING 

DELAYING 

RALLYING 

CARRYING 

FERRYING 

APPROACH 

ENTRENCH 

INTRENCH 

RESEARCH 

DESPATCH 

DISPATCH 

SKIRMISH 

DIMINISH 


MEMORANDA 

STRATEGIC 

AUTOMATIC 

PATRIOTIC 

BALLISTIC 

BEACHHEAD 

SPEARHEAD 

DESCRIBED 

ANNOUNCED 

BLOCKADED 

SUCCEEDED 

PROCEEDED 

COMMANDED 

SUSPENDED 

BOMBARDED 

FORTIFIED 


ELEVENTH 

ANTITANK 

CODEBOOK 

CHEMICAL 

CLERICAL 

TACTICAL 

CRITICAL 

NAUTICAL 

OFFICIAL 

MATERIAL 

MEMORIAL 

NATIONAL 

INTERNAL 

CORPORAL 

HOSPITAL 

APPROVAL 

MATERIEL 

PARALLEL 

SENTINEL 

SEALEVEL 

PROTOCOL 

MERCIFUL 

TELEGRAM 

AMERICAN 

EUROPEAN 

CIVILIAN 

HAVEBEEN 

NINETEEN 

EIGHTEEN 

THIRTEEN 

FOURTEEN 


CAMPAIGN 

CHAPLAIN 

MAINTAIN 

MOUNTAIN 

BULLETIN 

INVASION 

DECISION 

DIVISION 

LOCATION 

AVIATION 

CITATION 

TAXATION 

JUNCTION 

IGNITION 

POSITION 

FORENOON 

SQUADRON 

GARRISON 

NORTHERN 

SOUTHERN 

CIRCULAR 

DECEMBER 

REMEMBER 

NOVEMBER 

DEFENDER 

RECORDER 

ENGINEER 

TRANSFER 

DECIPHER 

ENCIPHER 


PRISONER 

IMPROPER 

REPEATER 

DESERTER 

DISASTER 

REGISTER 

CANISTER 

RECEIVER 

REVOLVER 

OBSERVER 

MANEUVER 

EMPLOYER 

HOWITZER 

CORRIDOR 

SUPERIOR 

INTERIOR 

EXTERIOR 

OPERATOR 

DICTATOR 

REJECTOR 

DIRECTOR 

DETECTOR 

ASSOONAS 

POLITICS 

COMMANDS 

ADVANCES 

BARRAGES 

MESSAGES 

REMEDIES 

SUPPLIES 


VEHICLES 

MISFIRES 

DEFENSES 

EXPENSES 

PURPOSES 

RESERVES 

ANALYSIS 

BARRACKS 

MISSIONS 

STATIONS 

FACTIONS 

PONTOONS 

WARSHIPS 

OFFICERS 

SOLDIERS 

CARRIERS 

TRAILERS 

TRAWLERS 

CRUISERS 

FIGHTERS 

QUARTERS 

CARELESS 

WIRELESS 

BUSINESS 

DARKNESS 

CONGRESS 

PROGRESS 

FORTRESS 

DISTRESS 

REDCROSS 


RESPECTS 

ELEMENTS 

ATTEMPTS 

PROTESTS 

OUTPOSTS 

ENORMOUS 

LUMINOUS 

RIGOROUS 

VIGOROUS 

CONTRACT 

INDIRECT 

CONFLICT 

DISTRICT 

INSTRUCT 

AIRCRAFT 

DAYLIGHT 

MIDNIGHT 

PROHIBIT 

SERGEANT 

DOMINANT 

ADJUTANT 

ADJACENT 

INCIDENT 

ARMAMENT 

MOVEMENT 

REGIMENT 

APPARENT 

PASSPORT 

INTEREST 

REENLIST 


WITHDRAW 

WITHDREW 

TOMORROW 

PARALLAX 

SATURDAY 

THURSDAY 

CAUSEWAY 

EFFICACY 

IDENTIFY 

STRATEGY 

PROBABLY 

ASSEMBLE 

ACTUALLY 

MONOPOLY 

EASTERLY 

WESTERLY 

BOUNDARY 

MILITARY 

SANITARY 

FEBRUARY 

CEMETERY 

ADVISORY 

INFANTRY 

CAPACITY 

FATALITY 

CALAMITY 

VICINITY 

PRIORITY 

ACTIVITY 

CASUALTY 


AERODROME 

HURRICANE 

AEROPLANE 

INTERVENE 

FRONTLINE 

DETERMINE 

TELEPHONE 

INTERFERE 

ELSEWHERE 

SHELLFIRE 

THEREFORE 

PROCEDURE 

PREMATURE 

DEPARTURE 

NAVALBASE 

MANGANESE 


CANCELLED 

COMPELLED 

DETRAINED 

ENTRAINED 

CONDEMNED 

ECHELONED 

DEVELOPED 

CONQUERED 

PREFERRED 

CONFERRED 

DECREASED 

INCREASED 

CONDENSED 

COLLAPSED 

DISPERSED 

ADDRESSED 


NINE LETTER WORDS 


IMPRESSED 

DISCUSSED 

INDICATED 

POPULATED 

ESTIMATED 

DOMINATED 

DETONATED 

SUSPECTED 

CORRECTED 

PROTECTED 

INFLICTED 

COMPLETED 

INHABITED 

EXHIBITED 

ASSAULTED 

APPOINTED 


ATTEMPTED 

PROTESTED 

REQUESTED 

SUBMITTED 

CONTINUED 

DESTROYED 

MOTORIZED 

SEMIRIGID 

RECOMMEND 

REARGUARD 

NORTHWARD 

SOUTHWARD 

AMBULANCE 

DOMINANCE 

CLEARANCE 

ENDURANCE 


ASSURANCE 

ALLOWANCE 

INCIDENCE 

REFERENCE 

INFLUENCE 

REENFORCE 

REINFORCE 

LONGITUDE 

COMMITTEE 

ADVANTAGE 

CARTRIDGE 

CHALLENGE 

AVAILABLE 

UNTENABLE 

DIRIGIBLE 

PRINCIPLE 
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CRITICISE 

INTERPOSE 

ASSOCIATE 

IMMEDIATE 

OSCILLATE 

CIRCULATE 

DESIGNATE 

ALTERNATE 

COOPERATE 

ELABORATE 

PENETRATE 

REINSTATE 

CIGARETTE 

PARACHUTE 

DESTITUTE 

TECHNIQUE 

EXPANSIVE 

DEFENSIVE 

OFFENSIVE 

EXPENSIVE 

INTENSIVE 

EXTENSIVE 

EXPLOSIVE 

EXCESSIVE 

INCLUSIVE 

EXCLUSIVE 

TENTATIVE 

DEFECTIVE 

EFFECTIVE 

OBJECTIVE 

INCENTIVE 

EXECUTIVE 

RECOGNIZE 

SERVICING 

ADVANCING 

PRECEDING 

EXTENDING 


ATOMICBOMB 

GEOGRAPHIC 

GYROSCOPIC 

DIPLOMATIC 

BRIDGEHEAD 

PRESCRIBED 

REENFORCED 

REINFORCED 

BEENNEEDED 

UNEXPENDED 


NINE LETTER WORDS—Continued 


REGARDING 

ACCORDING 

INCLUDING 

LAUNCHING 

ATTACKING 

DEBARKING 

REFILLING 

SCREENING 

REMAINING 

OBTAINING 

INCLINING 

BEGINNING 

RETURNING 

PREPARING 

NUMBERING 

CENTERING 

REQUIRING 

OPERATING 

ENLISTING 

RECEIVING 

REVIEWING 

EMPLOYING 

OCCUPYING 

PARAGRAPH 

ESTABLISH 

TWENTIETH 

FIFTEENTH 

SIXTEENTH 

WATERTANK 

TECHNICAL 

CHRONICAL 

PRACTICAL 

POLITICAL 

IDENTICAL 

PRINCIPAL 

DISMISSAL 

CONTINUAL 


PERSONNEL 

CABLEGRAM 

RADIOGRAM 

FIREALARM 

CRITICISM 

MECHANISM 

DIETITIAN 

SEVENTEEN 

SUSPICION 

BATTALION 

REBELLION 

COLLISION 

PROVISION 

EXPANSION 

ASCENSION 

DIMENSION 

EXTENSION 

EXPLOSION 

ADMISSION 

EXCLUSION 

RADIATION 

VARIATION 

INFLATION 

FORMATION 

OPERATION 

SITUATION 

ELEVATION 

OBJECTION 

DIRECTION 

CONDITION 

COALITION 

PARTITION 

DETENTION 

RETENTION 

INTENTION 

ATTENTION 


INVENTION 

PROMOTION 

SEMIC'OLON 

AFTERNOON 

DISAPPEAR 

IRREGULAR 

SEPTEMBER 

COMMANDER 

SURRENDER 

REMAINDER 

PASSENGER 

MESSENGER 

BRIGADIER 

STRAGGLER 

NEWSPAPER 

CHARACTER 

KfLOMETER 

BAROMETER 

GYROMETER 

DESTROYER 

PROJECTOR 

PROTECTOR 

CHAUFFEUR 

LOGISTICS 

STANDARDS 

RESOURCES 

COMPANIES 

BATTERIES 

EMBASSIES 

AIRDROMES 

SEAPLANES 

AIRPLANES 

EXERCISES 

WITNESSES 

ADDRESSES 

ESTIMATES 


CONTINUES 

BUILDINGS 

OFFICIALS 

REPRISALS 

PROPOSALS 

CIVILIANS 

CAMPAIGNS 

MAINTAINS 

DIVISIONS 

MUNITIONS 

POSITIONS 

ENGINEERS 

PRISONERS 

READINESS 

CONFLICTS 

DISTRICTS 

INCIDENTS 

MOVEMENTS 

OUTSKIRTS 

ANONYMOUS 

APPARATUS 

DISINFECT 

INTERDICT 

DIFFICULT 

COMBATANT 

IMPORTANT 

ASSISTANT 

CONFIDENT 

PRESIDENT 

DEPENDENT 

NEGLIGENT 

DEFICIENT 

EFFICIENT 

PLACEMENT 

AGREEMENT 

AMUSEMENT 


'ri’EN LETTER WORDS 


APPROACHED 

ENTRENCHED 

DESPATCHED 

DISPATCHED 

THREATENED 

MAINTAINED 

DETERMINED 

ONEHUNDRED 

DECIPHERED, 

ENCIPHERED 


COMPRESSED 

DISTRESSED 

DESIGNATED 

RESTRICTED 

INSTRUCTED 

PROHIBITED 

REENLISTED 

MECHANIZED 

CONTRABAND 

UNDERSTAND 


UNDERSTOOD 

COASTGUARD 

POSTOFFIQE 

ACCORDANCE 

ALLEGIANCE 

APPEARANCE 

ACCEPTANCE 

RESISTANCE 

ASSISTANCE 

PRECEDENCE 


3-1 6 


STATEMENT 

EQUIPMENT 

GROUPMENT 

INTERMENT 

ALLOTMENT 

PERMANENT 

DIFFERENT 

REPRESENT 

RESTRAINT 

INTERCEPT 

INTERRUPT 

TRANSPORT 

NORTHEAST 

SOUTHEAST 

NORTHWEST 

SOUTHWEST 

INTERVIEW 

YESTERDAY 

WEDNESDAY 

EMERGENCY 

NORTHERLY 

SERIOUSLY 

INSTANTLY 

ACCOMPANY 

ARBITRARY 

NECESSARY 

SECRETARY 

ARTILLERY 

ACCESSORY 

TERRITORY 

LIABILITY 

HOSTILITY 

PROXIMITY 

INDEMNITY 

INTEGRITY 

NECESSITY 


CONFIDENCE 

NEGLIGENCE 

EXPERIENCE 

PREFERENCE 

DIFFERENCE 

CONFERENCE 

CAMOUFLAGE 

DEPENDABLE 

EXPENDABLE 

UNSUITABLE 
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TEN LETTER WORDS—Continued 


ACCEPTABLE 

EVACUATING 

ALLOCATION 

GONIOMETER 

CONTINGENT 

IMPASSIBLE 

COLLECTING 

FOUNDATION 

HYDROMETER 

SUFFICIENT 

IMPOSSIBLE 

CONNECTING 

RECREATION 

HYGROMETER 

CONVENIENT 

ASPOSSIBLE 

INFLICTING 

IRRIGATION 

AMBASSADOR 

EQUIVALENT 

RECEPTACLE 

EXPEDITING 

NAVIGATION 

INSTRUCTOR 

ENGAGEMENT 

MOTORCYCLE 

RECRUITING 

REGULATION | 

BALLISTICS 

MANAGEMENT 

AUTOMOBILE 

ATTEMPTING 

'POPULATION 

STATISTICS 

EXCITEMENT 

DISCIPLINE 

SUPPORTING 

ESTIMATION 

CROSSROADS 

DETACHMENT 

QUARANTINE 

EXTINGUISH 

DOMINATION 

DESPATCHES 

ATTACHMENT 

ENTERPRISE 

NINETEENTH 

DETONATION 

DISPATCHES 

EXPERIMENT 

TRANSVERSE 

EIGHTEENTH 

OCCUPATION 

ASSEMBLIES 

ENROLLMENT 

COORDINATE 

THIRTEENTH 

SEPARATION 

FACILITIES 

ASSIGNMENT 

ILLUMINATE 

FOURTEENTH 

DECORATION 

ACTIVITIES 

ATTAINMENT 

ANTICIPATE 

WILLATTACK 

LIMITATION 

CASUALTIES 

INTERNMENT 

ILLITERATE 

ARTIFICIAL 

SANITATION 

FRONTLINES 

GOVERNMENT 

ILLUSTRATE 

CREDENTIAL 

INVITATION 

SUBMARINES 

ASSESSMENT 

COMPENSATE 

ADDITIONAL 

EVACUATION 

OBJECTIVES 

COMMITMENT 

DISTRIBUTE 

ACCIDENTAL 

EVALUATION 

ENEMYTANKS 

DEPARTMENT 

SUBSTITUTE 

REGIMENTAL 

EXCAVATION 

SUSPICIONS 

ENLISTMENT 

CONSTITUTE 

INDIVIDUAL 

COLLECTION 

COLLISIONS 

INSTRUMENT 

COMMUNIQUE 

WITHDRAWAL 

CONNECTION 

PROVISIONS 

DEPLOYMENT 

TWENTYFIVE 

AIRCONTROL 

INSPECTION 

EXPLOSIONS 

EMPLOYMENT 

SUCCESSIVE 

SUCCESSFUL 

CORRECTION 

FORMATIONS 

PERSISTENT 

IMPRESSIVE 

RESPECTFUL 

PROTECTION 

OPERATIONS 

AIRSUPPORT 

LOCOMOTIVE 

MEMORANDUM 

EXHIBITION 

DIRECTIONS 

CONSPIRACY 

CENTRALIZE 

SUSPENSION 

EXPEDITION 

CONDITIONS 

DEFICIENCY 

NATURALIZE 

DISPERSION 

DEFINITION 

TROOPSHIPS 

EFFICIENCY 

DEMOBILIZE 

CONCESSION 

AMMUNITION 

NEWSPAPERS 

COMPLETELY 

COMMANDING 

CONFESSION 

OPPOSITION ' 

KILOMETERS 

APPARENTLY 

DEBOUCHING 

DEPRESSION 

PROPORTION 

DESTROYERS 

INCENDIARY 

DETRUCKING 

IMPRESSION 

REVOLUTION 

TRANSPORTS 

COMMISSARY 

ENTRUCKING 

POSSESSIQN 

MACHINEGUN 

SUSPICIOUS 

ELEMENTARY 

ENCIRCLING 

SUBMISSION 

BATTLESHIP 

VICTORIOUS 

LABORATORY 

SIGNALLING 

COMMISSION 

CENSORSHIP 

CIRCUITOUS 

TRAJECTORY 

PATROLLING 

PERMISSION 

ARMOREDCAR 

CONTINUOUS 

CAPABILITY 

OVERCOMING 

DISCUSSION 

DIVEBOMBER 

PHOSPHORUS 

AUDIBILITY 

DETRAINING 

CONCLUSION 

COMMANDEER 

FLASHLIGHT 

VISIBILITY 

CONCERNING 

DEDICATION 

DISPATCHER 

COMMANDANT 

SIMILARITY 

INDICATING 

INDICATION 

MILLIMETER 

LIEUTENANT 

INSECURITY 

ANTEDATING 







ELEVEN LETTER WORDS 



IMPEDIMENTA 

SURRENDERED 

CONSTITUTED 

INFLAMMABLE 

CERTIFICATE 

TOPOGRAPHIC 

ENCOUNTERED 

BATTLEFIELD 

RESPONSIBLE 

COMMUNICATE 

RECOMMENDED 

TRANSFERRED 

PERFORMANCE 

NAVALBATTLE 

INVESTIGATE 

PREARRANGED 

DISINFECTED 

MAINTENANCE 

TEMPERATURE 

APPROPRIATE 

ESTABLISHED 

REAPPOINTED 

COINCIDENCE^ 

MANUFACTURE 

APPROXIMATE 

OVERWHELMED 

INTERCEPTED 

SUBSISTENCE 

SCHOOUjpUSE 

EXTERMINATE 

DISAPPEARED 

INTERRUPTED 

CATASTROPHE 

CUSTOMHOUSE 

DETERIORATE 
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CONCENTRATE 

DEMONSTRATE 

NECESSITATE 

DISCONTINUE 

SEVENTYPIVE 

PROGRESSIVE 

RETROACTIVE 

DESCRIPTIVE 

SYNCHRONIZE 

APPROACHING 

INTERVENING 

ENGINEERING 

INTERFERING 

ALTERNATING 

INTERESTING 

WITHDRAWING 

DISTINGUISH 

SEVENTEENTH 

NAVALATTACK 

STRATEGICAL 

TRADITIONAL 

CONTINENTAL 

FIRECONTROL 

NATIONALISM 


TRANSPACIFIC 

HYDROGRAPHIC 

UNIDENTIFIED 

COMMISSIONED 

DISSEMINATED 

CONCENTRATED 

DEMONSTRATED 

DISORGANIZED 

SIGNIFICANCE 

INTELLIGENCE 

INTERFERENCE 

INCOMPETENCE 

CONSIDERABLE 

FIGHTERPLANE 

INTERMEDIATE 

DECENTRALIZE 

GENERALSTAFF 

TRANSFERRING 

ENTERPRISING 

ILLUMINATING 

DISTRIBUTING 


ELEVEN LETTER WORDS—Continued 


SMOKESCREEN 

APPLICATION 

ASSOCIATION 

RETALIATION 

DEBARKATION 

EMBARKATION 

LEGISLATION 

CIRCULATION 

INFORMATION 

EXPLANATION 

DESIGNATION 

RESIGNATION 

EXAMINATION 

PREPARATION 

COOPERATION 

IMMIGRATION 

INSPIRATION 

CORPORATION 

PENETRATION 

ARBITRATION 

COMPUTATION 

OBSERVATION 

RESERVATION 

RESTRICTION 


DISTINCTION 

DESTRUCTION 

INSTRUCTION 

RECOGNITION 

REQUISITION 

COMPOSITION 

DISPOSITION 

COMPETITION 

DESCRIPTION 

CONSUMPTION 

INSTITUTION 

LIGHTBOMBER 

HEAVYBOMBER 

RANGEFINDER 

DYNAMOMETER 

THERMOMETER 

INTERPRETER 

RECONNOITER 

BLOCKBUSTER 

AERONAUTICS 

NAVALFORCES 

ACCESSORIES 

HOSTILITIES 

ENEMYPLANES 


PHILIPPINES 

PARENTHESES 

HEAVYLOSSES 

COMMUNIQUES 

PARENTHESIS 

CREDENTIALS 

BATTLESHIPS 

ARMOREDCARS 

CORRECTNESS 

ENGAGEMENTS 

ASSIGNMENTS 

ASSESSMENTS 

INSTRUMENTS 

INTERCERPTS 

ESTIMATEDAT 

SIGNIFICANT 

INDEPENDENT 

INTELLIGENT 

COEFFICIENT 

BOMBARDMENT 

REPLACEMENT 

EMPLACEMENT 

ENFORCEMENT 

ARRANGEMENT 


TWELVE LETTER WORDS 


CONSTITUTING 

BREAKTHROUGH 

GEOGRAPHICAL 

CONFIDENTIAL 

PRESIDENTIAL 

RECREATIONAL 

AGRICULTURAL 

DEPARTMENTAL 

UNSUCCESSFUL 

GENERALALARM 

VETERINARIAN 

TRANSMISSION 

VERIFICATION 

CONFISCATION 

COMMENDATION 

CONCILIATION 

CANCELLATION 

PROCLAMATION 

CONFIRMATION 

CONFORMATION 

COORDINATION 


ILLUMINATION 

ANTICIPATION 

REGISTRATION 

ILLUSTRATION 

INAUGURATION 

COMPENSATION 

CONVERSATION 

RADIOSTATION 

CONTINUATION 

PRESERVATION 

MOBILIZATION 

ORGANIZATION 

INTERDICTION 

ROADJUNCTION 

INTRODUCTION 

CONSTRUCTION 

INTERVENTION 

CONSCRIPTION 

INTERRUPTION 

DISTRIBUTION 

SUBSTITUTION 


CONSTITUTION 

NORTHWESTERN 

SOUTHWESTERN 

MARKSMANSHIP 

MEDIUMBOMBER 

COMMISSIONER 

PSYCHROMETER 

SHARPSHOOTER 

DIFFICULTIES 

UNITEDSTATES 

PREPARATIONS 

OBSTRUCTIONS 

INSTRUCTIONS 

LIGHTBOMBERS 

HEAVYBOMBERS 

HEADQUARTERS 

PREPAREDNESS 

COMPLETENESS 

CARELESSNESS 

SEARCHLIGHTS 

REPLACEMENTS 


CONFINEMENT 

REQUIREMENT 

MEASUREMENT 

IMPROVEMENT 

CONCEALMENT 

ECHELONMENT 

DEVELOPMENT 

APPOINTMENT 

COMPARTMENT 

BELLIGERENT 

INCOMPETENT 

FINGERPRINT 

DISCREPANCY 

PHOTOGRAPHY 

IMMEDIATELY 

EXTENSIVELY 

EFFECTIVELY 

PRELIMINARY 

CONTROVERSY 

ELECTRICITY 

NATIONALITY 

SUITABILITY 

SUPERIORITY 


EMPLACEMENTS 

MEASUREMENTS 

ADVANTAGEOUS 

SIMULTANEOUS 

ANTIAIRCRAFT 

NONCOMBATANT 

CONVALESCENT 

DISPLACEMENT 

COMMENCEMENT 

ANNOUNCEMENT 

ENTANGLEMENT 

DECIPHERMENT 

ENCIPHERMENT 

REENLISTMENT 

INEFFICIENCY 

SUCCESSFULLY 

RESPECTFULLY 

SATISFACTORY 

INTRODUCTORY 

IRREGULARITY 
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TRANSATLANTIC 

DISTINGUISHED 

DECENTRALIZED 

DISAPPEARANCE 

IMPRACTICABLE 

INDETERMINATE 

CORRESPONDING 

CONCENTRATING 

COUNTERATTACK 


CHARACTERISTIC 

RECONNAISSANCE 

DISCONTINUANCE 

CORRESPONDENCE 

ADMINISTRATIVE 

REPRESENTATIVE 

DISTINGUISHING 


THIRTEEN LETTER WORDS 


CHRONOLOGICAL 

CONGRESSIONAL 

INTERNATIONAL 

SPECIFICATION 

QUALIFICATION 

COMMUNICATION 

ACCOMMODATION 

INVESTIGATION 

DISSEMINATION 


DETERMINATION 

EXTERMINATION 

CONSIDERATION 

CONCENTRATION 

DEMONSTRATION 

QUARTERMASTER 

CIRCUMSTANCES 

DISCREPANCIES 

PRELIMINARIES 


FIGHTERPLANES 

INSTALLATIONS 

MEDIUMBOMBERS 

MISCELLANEOUS 

INSTANTANEOUS 

REENFORCEMENT 

REINFORCEMENT 

REIMBURSEMENT 

REINSTATEMENT 


FOURTEEN LETTER WORDS 


RECONNOITERING 

Meteorological 

CIRCUMSTANTIAL 

CLASSIFICATION 

IDENTIFICATION 

RECOMMENDATION 


ADMINISTRATION 

INTERPRETATION 

TRANSPORTATION 

CENTRALIZATION 

NATURALIZATION 

DEMOBILIZATION 



ESTABLISHMENT 

ENTERTAINMENT 

REAPPOINTMENT 

WARDEPARTMENT 

APPROXIMATELY 

EXTRAORDINARY 

REVOLUTIONARY 

DEPENDABILITY 


REORGANIZATION 

RECONSTRUCTION 

IRREGULARITIES 

INVESTIGATIONS 

SATISFACTORILY 

RESPONSIBILITY 
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C. LIST OF WORDS USED IN MILITARY TEXT ARRANGED ALPHABETICALLY 

ACCORDING TO WORD PATTERN 


A 

CC 

EPT 

A 

CC 

ORDING 

0 

CC 

UPY 

A 

DD 


SU 

DD 

EN 

LA 

DD 

ER 

BE 

DD 

ING 

FL 

EE 


S 

EE 


THR 

EE 


FROC 

EE 

D 

SP 

EE 

D 

CR 

EE 

K 

W 

EE 

K 

F 

EE 

L 

ST 

EE 

L 

WH 

EE 

L 

B 

EE 

N 

FOURT 

EE 

N 

HASB 

EE 

N 

QU 

EE 

N 

SCR 

EE 

N 

S 

EE 

N 

SIXT 

EE 

N 

R 

EE 

NLIST 

K 

EE 

P 

Sff 

EE 

PING 

F 

EE 

T 

FL 

EE 

T 

M 

EE 

T 

JUMPO 

FF 


0 

FF 


STA 

FF 


0 

FF 

END 

SU 

FF 

ER 

TRA 

FF 

IC 

0 

FF 

ICE 

0 

FF 

ICER 

E 

FF 

ORT 

FO 

G6 

Y 

A 

LL 


CA 

LL 


CE 

LL 


DRI 

LL 


ENRO 

LL 



PATTERN AA 

FA 

LL 


FE 

LL 


FU 

LL 


HI 

LL 


I 

LL 


INSTA 

LL 


PAYRO 

LL 


REFI 

LL 


SHE 

LL 


SMA 

LL 

1 

SPE 

LL 


WE 

LL 


WI 

LL 


VI 

LL 

AGE 

CO 

LL 

APSED 

DO 

LL 

AR 

OSCI 

LL 

ATE 

KI 

LL 

ED 

BI 

LL 

ET 

BU 

LL 

ETIN 

VA 

LL 

EY 

A 

LL 

IED 

A 

LL 

IES 

FA 

LL 

ING 

o 

1 

LL 

ING 

SHE 

LL 

ING 

A 

LL 

OW 

A 

LL 

Y 

RA 

LL 

Y 

•CO 

MM 

A 

CO 

MM 

AND 

CO 

MM 

ANDER 

SU 

MM 

AR^f 

CO 

MM 

END 

CO 

MM 

ENiy 

HA 

MM 

ER 

SU 

MM 

ER 

CO 

MM 

IT 

SU 

MM 

IT 

SU 

MM 

ON 

CO 

MM 

UTB/ 

TO 

NN 

AGE 

CHA 

NN 

EL 

BA 

NN 

ER 

GU 

NN 

ER 


MA 

NN 

ER 

A 

NN 

EX 

CA 

NN 

OT 

T 

00 


W 

00 

DS 

PR 

00 

F 

B 

00 

K 

C 

00 

K 

H 

00 

K 

L 

00 

K 

T 

00 

K 

SCH 

00 

L 

T 

00 

L 

PLAT 

00 

N 

S 

00 

N 

TR 

00 

PS 

C 

00 

RDINATE 

B 

00 

TH 

STO 

PP 

ED 

HA 

PP 

EN 

CLI 

PP 

ER 

MA 

PP 

ING 

A 

PP 

LY 

SU 

PP 

LY 

A 

PP 

OINT 

A 

PP 

OINTED 

SU 

PP 

ORT 

SU 

PP 

ORTING 

A 

PP 

ROVE 

TE 

RR 

AIN 

CU 

RR 

ENT 

A 

RR. 

,EST 

HU 

RR 

ICANE 

DE 

RR 

ICK 

GA 

RR 

ISON 

A 

RR 

IVE 

CA 

RR 

Y 

FE 

RR 

Y 

ACRO 

SS 


COMPA 

SS 


CONGRE 

SS 


CRO 

SS 


DARKNE 

SS 


DRE 

SS 


LE 

SS 
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PATTERN 

AA- 

-Continued 




LO 

SS 


A 

SS 

IGNED 

BA 

TT 

EN 

MA 

SS 


CRO 

SS 

ING 

WRI 

TT 

EN 

ME 

SS 


DRE 

SS 

ING 

BI 

TT 

ER 

PA 

SS 


ME 

SS 

ING 

LI 

TT 

ER 

PRE 

SS 


PA 

SS 

IVE 

BA 

TT 

ERY 

UNLE 

SS 


LE 

SS 

ON 

SPO 

TT 

ING 

WITNE 

SS 


I 

SS 

UE 

BA 

TT 

LE 

PA 

SS 

ED 

A 

SS 

URE 

BA 

TT 

LESHIP 

A 

SS 

EMBLY 

EMBA 

SS 

Y 

MU 

ZZ 

LE 

A 

SS 

ET 

OMI 

TT 

ED 

NO 

zz 

LE 

PO 

SS 

IBLE 

SUBMI 

TT 

ED 





MISCELLANEOUS PATTERNS 


AABA 

AGR EEME NT 

AABCB 

SU FFICI ENT 

AABA 

K EEPE R 

AABCB 

A LLEGE 

AABA 

CH EESE 

AABCB 

CO LLEGE 

AABA 

BR EEZE 

AABCB 

BI LLETE D 

AABA 

MA NNIN G 

AABCB 

A MMETE R 

AABA 

PLA NNIN G 

AABCB 

W OODED 

AABA 

RU NNIN G 

AABCB 

TE RRIFI C 

AABA 

L OOKO UT 

AABCB 

BA TTERE D 

AABA 

E RROR 

AABCBDEB 

DI FFERENCE 

AABA 

MI RROR 

AABCC 

A CCESS 

AABA 

TE RROR 

AABCC 

A CCESS ORY 

AABA 

GLA SSES 

AABCC 

CO MMISS ARY 

AABA 

LO SSES 

AABCCB 

WI LLATTA CK 

AABA 

PA SSES 

AABCCDD 

CO MMITTEE 

AABA 

CHA SSIS 

.AjABCCDEFBC 

A CCESSORIES 

AABA 

A SSIS T 

f 

I.ABCDA 

I LLEGAL 

AABAACB 

A SSESSME NT 

t 

ABCDA 

A TTEMPT 

AABAACBDEA 

A SSFSSMENTS 

4 

ABCDAB 

A TTEMPTE D 

AABAB 

PROC EEDED 


ABCDB 

0 FFENSE 

AABB 

CO FFEE 


iABCDB 

CHA LLENGE 

AABB 

BA LLOO N 


(ABCDB 

BA LLISTI C 

AABBAACAC 

B EENNEEDED 

1 

lABCDB 

A RRESTE D 

AABBCBC 

SU CCEEDED 

AABCDB 

PA SSENGE R 

AABCA 

B EETLE 

AABCDB 

BA TTERIE S 

AABCA 

A NNOUN CE 

AABCDBA 

SU RRENDER 

AABCA 

F OOTHO LD 

AABCDBABD 

SU RRENDERED 

AABCA 

CA RRIER 

AABCDBC 

CO MMANDAN T 

AABCA 

A SSETS 

AABCDBD 

0 FFENDED 

AABCA 

I SSUES 

AABCDBEC 

BA LLISTICS 

AABCADEC 

CO MMITMENT 

aIabcdc 

E FFICAC Y 

AABCADEC 

A TTENTION 

AABCDD 

A DDRESS 

AABCADEFEA 

A NNOUNCEMEN T 

/ 

lABCDD 

I LLNESS 

AABCB 

SQ$ EENIN G 

A'ABCDDCA 

A DDRESSED 

AABCB 

SJ FFERE D 

a'abcddcd 

A DDRESSES 

AABCB 

DI FFERE NT 

a!abcdeb 

1 

CO MMUNIQU E 

AABCB 

0 FFICI AL 

AABCDEB 

TR OOPSHIP 
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AABCDEB 

A 

SSEMBLE 

ABA 

INVA 

DED 


AABCDEBC 

TR 

OOPSHIPS 

ABA 

LAN 

DED 


AABCDEC 

CO 

MMANDIN G 

ABA 

RAI 

DED 


AABCDECS 

BA 

TTLEFIEL D 

ABA 

WOUN 

DED 


AABCDED 

CO 

MMANDED 

ABA 


DID 


AABCDEDF(J 

A 

^MUNITION 

ABA 

IC 

EBE 

RG 

AABCDEE 

CO 

MMANDEE R 

ABA 

PR 

ECE 

DING 

AABCDEFA 

R 

EENLISTE D 

ABA 

R 

ECE 

IPT 

AABCDEFA 

I 

RREGULAR 

ABA 

CR 

EDE 

NTIAL 

AABCDEFB 

0 

FFENSIVE 

ABA 

F 

EDE 

RAL 

AABCDEFBA 

A 

SSEMBLIES 

ABA 

D 

EFE 

AT 

AABCDEFC 

A 

LLOTMENT 

ABA 

D 

EFE 

CT 

AABCDEFC 

C 

OOPERATE 

ABA 

D 

EFE 

R 

AABCDEFD 

I 

LLUSTRAT E 

ABA 

SI 

EGE 


AABCDEFD 

A 

SSIGNMEN T 

ABA 

R 

EJE 

CT 

AABCDEFDGA 

A 

SSIGNMENTS 

ABA 

S 

ELE 

CT 

AABCDEFGA 

C 

OOPERATIO N 

ABA 

T 

ELE 

GRAM 

AABCDEFGABF 

R 

EENLISTMENT 

ABA 


ELE 

VATION 

AABCDEFGD 

BA 

TTLESHIPS 

ABA 

SCH 

EME 


AABCDEFGDAE 

C 

OORDINATION 

ABA 

R 

EME 

DY 

AABCDEFGDE 

A 

PPOINTMENT 

ABA 

DISPLAC 

EME 

NT 

ABA 


AGA IN 

ABA 

PLAC 

EME 

NT 

ABA 


AGA INST 

ABA 


ENE 

MY 

ABA 

C 

ALA MITY 

ABA 

G 

ENE 

RAL 

ABA 


ALA RM 

ABA 

R 

EPE 

L 

ABA 

S 

ALA RY 

ABA 

H 

ERE 


ABA 

D 

AMA GE 

ABA 

SPH 

ERE 


ABA 

M 

ANA GE 

ABA 

TH 

ERE 


ABA 

C 

ANA L 

ABA 

W 

ERE 


ABA 


ANA LYZE 

ABA 

WH 

ERE 


ABA 

J 

APA N 

ABA 

CONQU 

ERE 

D 

ABA 

P 

ARA CHUTE 

ABA 

COV 

ERE 

D ' 

ABA 

P 

ARA DE 

ABA 

TH 

ESE 


ABA 

SEP 

ARA TION 

ABA 

PR 

ESE 

NT 

ABA 

F 

ATA L 

ABA 

D 

ESE 

RT 

ABA 

N 

AVA L 

ABA 

COMPL 

ETE 


ABA 

N 

AVA LFORCES 

ABA 

KILOM 

ETE 

R 

ABA 

C 

AVA LRY 

ABA 

M 

ETE 

R 

ABA 

EXC 

AVA TION 

ABA 

P 

ETE 

R 

ABA 


AWA IT 

ABA 

D 

EVE 

LOP 

ABA 


AWA RD 

ABA 

S 

EVE 

N 

ABA 


AWA Y 

ABA 

S 

EVE 

NTH 

ABA 

PRO 

BAB LE 

ABA 

S 

EVE 

NTY 

ABA 

PRO 

BAB LY 

ABA 

S 

EVE 

RAL 

ABA 

BI 

CYC LE 

ABA 


EVE 

RY 

ABA 


CYC LONE 

ABA 


EYE 


ABA 

BLOCKA 

DED 

ABA 


FIF 

TH 

ABA 

GROUN 

DED 

ABA 


FIF 

TY 

ABA 

GUAR 

DED 

ABA 

EIG 

HTH 
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PATTERN AA—Continued 



LO SS 



A SS IGNED 


BA TT EN 

MA SS 



CRO SS ING 


WRI TT EN 

ME SS 



DRE SS ING 


BI TT ER 

PA SS 



ME SS ING 


LI TT ER 

PRE SS 



PA SS IVE 


BA TT ERY 

UNLE SS 



LE SS ON 


SPO TT ING 

WITNE SS 



I SS UE 


BA TT LE 

PA SS 

ED 


A SS URE 


BA TT LESHIP 

A SS 

EMBLY 


EMBA SS Y 


MU ZZ LE 

A SS 

ET 


OMI TT ED 


NO ZZ LE 

PO SS 

IBLE 

SUBMI TT ED 





MISCELLANEOUS PATTERNS 



AABA 


AGR EEME NT 

AABCB 

SU 

FFICI ENT 

AABA 


K EEPE R 

AABCB 

A 

LLEGE 

AABA 


CH EESE 

AABCB 

CO 

LLEGE 

AABA 


BR EEZE 

AABCB 

BI 

LLETE D 

AABA 


MA NNIN G 

AABCB 

A 

MMETE R 

AABA 


PLA NNIN G 

AABCB 

W 

OODED 

AABA 


RU NNIN G 

AABCB 

TE 

RRIFI C 

AABA 


L 00K0 UT 

AABCB 

BA 

TTERE D 

AABA 


E RROR 

AABCBDEB 

DI 

FFERENCE 

AABA 


MI RROR 

AABCC 

A 

CCESS 

AABA 


TE RROR 

AABCC 

A 

CCESS ORY 

AABA 


GLA SSES 

AABCC 

CO 

MMISS ARY 

AABA 


LO SSES 

AABCCB 

WI 

LLATTA CK 

AABA 


PA SSES 

AABCCDD 

CO 

MMITTEE 

AABA 


CHA SSIS 

AABCCDEFBC 

A 

CCESSORIES 

AABA 


A SSIS T 

AABCDA 

I 

LLEGAL 

AABAACB 


A SSESSME NT AABCDA 

A 

TTEMPT 

AABAACBDEA 

A SSFSSMENTS AABCDAB 

A 

TTEMPTE D 

AABAB 


PROC EEDED 

AABCDB 

0 

FFENSE 

AABB 


CO FFEE 

AABCDB 

CHA 

LLENGE 

AABB 


BA LLOO N 

AABCDB 

BA 

LLISTI C 

AABBAACAC 

B EENNEEDED AABCDB 

A 

RRESTE D 

AABBCBC 


SU CCEEDED 

AABCDB 

PA 

SSENGE R 

AABCA 


B EETLE 

AABCDB 

BA 

TTERIE S 

AABCA 


A NNOUN CE 

AABCDBA 

SU 

RRENDER 

AABCA 


F OOTHO LD 

AABCDBABD 

SU 

RRENDERED 

AABCA 


CA RRIER 

AABCDBC 

CO 

MMANDAN T 

AABCA 


A SSETS 

AABCDBD 

0 

FFENDED 

AABCA 


I SSUES 

A.ABCDBEC 

BA 

LLISTICS 

AABCADEC 


SQ MMITMENT 

AlABCDC 

E 

FFICAC Y 

AABCADEC 


A TTENTION 

AABCDD 

A 

DDRESS 

AABCADEFEA 

A NNOUNCEMEN T AABCDD 

I 

LLNESS 

AABCB 


SQ£ BENIN G 

A'ABCDDCA 

A 

DDRESSED 

AABCB 


Sj FFERE D 

a'abcddcd 

A 

DDRESSES 

AABCB 


DI FFERE NT 

AjABCDEB 

CO 

MMUNIQU E 

AABCB 


0 FFICI AL 

AABCDEB 

TR 

OOPSHIP 
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MISCELLANEOUS PATTERNS—Continued 


AABCDEB 

A 

SSEMBLE 

ABA 

INVA 

DED 


AABCDEBC 

TR 

OOPSHIPS 

ABA 

LAN 

DED 


AABCDEC 

CO 

MMANDIN G 

ABA 

RAI 

DED 


AABCDEC? 

BA 

TTLEFIEL D 

ABA 

WOUN 

DED 


AABCDED 

CO 

MMANDED 

ABA 


DID 


AABCDEDFQ 

A 

MMUNITION 

ABA 

IC 

EBE 

RG 

AABCDEE 

CO 

MMANDEE R 

ABA 

PR 

ECE 

DING 

AABCDEFA 

R 

EENLISTE D 

ABA 

R 

ECE 

IPT 

AABCDEFA 

I 

RREGULAR 

ABA 

CR 

EDE 

NTIAL 

AABCDEFB 

0 

FFENSIVE 

ABA 

F 

EDE 

RAL 

AABCDEFBA 

A 

SSEMBLIES 

ABA 

D 

EFE 

AT 

AABCDEFC 

A 

LLOTMENT 

ABA 

D 

EFE 

CT 

AABCDEFC 

C 

OOPERATE 

ABA 

D 

EFE 

R 

AABCDEFD 

I 

LLUSTRAT E 

ABA 

SI 

EGE 


AABCDEFD 

A 

SSIGNMEN T 

ABA 

R 

EJE 

CT 

AABCDEFDGA 

A 

SSIGNMENTS 

ABA 

S 

ELE 

CT 

AABCDEFGA 

C 

OOPERATIO N 

ABA 

T 

ELE 

GRAM 

AABCDEFGABF 

R 

EENLISTMENT 

AJ3A 


ELE 

VATION 

AABCDEFGD 

BA 

TTLESHIPS 

ABA 

SCH 

EME 


AABCDEFGDAE 

C 

OORDINATION 

ABA 

R 

EME 

DY 

AABCDEFGDE 

A 

PPOINTMENT 

ABA 

DISPLAC 

EME 

NT 

ABA 


AGA IN 

ABA 

PLAC 

EME 

NT 

ABA 


AGA INST 

ABA 


ENE 

MY 

ABA 

C 

ALA MITY 

ABA 

G 

ENE 

RAL 

ABA 


ALA RM 

ABA 

R 

EPE 

L 

ABA 

S 

ALA RY 

ABA 

H 

ERE 


ABA 

D 

AMA GE 

ABA 

SPH 

ERE 


ABA 

U 

ANA GE 

ABA 

TH 

ERE 


ABA 

c 

ANA L 

ABA 

W 

ERE 


ABA 


ANA LYZE 

ABA 

WH 

ERE 


ABA 

J 

APA N 

ABA 

CONQU 

ERE 

D 

ABA 

p 

ARA CHUTE 

ABA 

COV 

ERE 

D ' 

ABA 

p 

ARA DE 

ABA 

TH 

ESE 


ABA 

SEP 

ARA TION 

ABA 

PR 

ESE 

NT 

.ABA 

F 

ATA L 

ABA 

D 

ESE 

RT 

ABA 

N 

AVA L 

ABA 

COMPL 

ETE 


ABA 

N 

AVA LFORCES 

ABA 

KILOM 

ETE 

R 

ABA 

C 

AVA LRY 

ABA 

M 

ETE 

R 

ABA 

EXC 

AVA TION 

ABA 

P 

ETE 

R 

ABA 


AWA IT 

ABA 

D 

EVE 

LOP 

ABA 


AffA RD 

ABA 

S 

EVE 

N 

ABA 


AWA Y 

ABA 

S 

EVE 

NTH 

ABA 

PRO 

BAB LE 

ABA 

S 

EVE 

NTY 

ABA 

PRO 

BAB LY 

ABA 

S 

EVE 

RAL 

ABA 

BI 

CYC LE 

ABA 


EVE 

RY 

ABA 


CYC LONE 

ABA 


EYE 


ABA 

BLOCKA 

DED 

ABA 


FIF 

TH 

ABA 

GROUN 

DED 

ABA 


FIF 

TY 

ABA 

GUAR 

DED 

ABA 

EIG 

HTH 
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ABA 

L 

IAI 

SON 

ABA 

CA 

RTR IDGE 

ABA 

PROH 

IBI 

T 

ABA 

D 

RYR UN 

ABA 

SERV, 

ICI 

NG 

ABA 

DI 

SAS TER 

ABA 

RA 

IDI 

NG 

ABA 

CA 

SES 

ABA 

R 

IDI 

NG 

ABA 

RE 

SIS T 

ABA 

R 

IGI 

D 

ABA 


SUS PEND 

ABA 

F 

ILI 

NG ‘ 

ABA 


SYS TEM 

ABA 

M 

ILI 

TARY 

ABA 

S 

TAT ION 

ABA 

MOB 

ILI 

ZE 

ABA 

DIC 

TAT OR 

ABA 

S 

IMI 

LAR 

ABA 


TIT LE 

ABA 

L 

IMI 

T 

ABA 

AL 

TIT UDE 

ABA 

PROX 

IMI 

TY 

ABA 

LA 

TIT UDE 

ABA 

F 

INI 

SH 

ABA 


TOT AL 

ABA 

F 

IRI 

NG 

ABA 


TOT ALING 

ABA 

RET 

IRI 

NG 

ABA 

A 

UGU ST 

ABA 

W 

IRI 

NG 

ABA 


USU AL 

ABA 

V 

ISI 

BLE 

ABA _ 

F 

UTU RE 

ABA 

D 

ISI 

NFECT 

ABA 

SUR 

VIV ED 

ABA 

ADV 

ISI 

NG 

ABAA 

HAV 

EBEE N 

ABA 

DEC 

ISI 

ON 

ABAA 


SESS ION 

ABA 

V 

ISI 

T 

ABAACC 


TATTOO 

ABA 

V 

ISI 

TOR 

ABAB 

DETRA 

ININ G 

ABA 

POL 

ITI 

CS 

■ ABAB 

L 

ININ G 

ABA 

CR 

ITI 

QUE 

'ABAB 

M 

ININ G 

ABA 

POS 

ITI 

VE 

'ABAB 

OBTA 

ININ G 

ABA 


MEM 

ORIAL 

ABAB 

RA 

ININ G 

ABA 


NAN 


ABAB 

REMA 

ININ G 

ABA 

DOMI 

NAN 

CE 

ABAB 

TRA 

ININ G 

ABA 

ORD 

NAN 

CE 

ABAB, 

CR 

ISIS 

ABA 

DOMI 

NAN 

T 

ABAB 

WI 

THTH E 

ABA 


NIN 

E 

ABAB 

PAR 

TITI ON 

ABA 


NIN 

ETY 

ABACA 

C 

ANADA 

ABA 

MOR 

NIN 

G 

ABACA 

P 

ANAMA 

ABA 


NIN 

TH 

ABACA 

PR 

ECEDE 

ABA 


OBO 

E 

ABACA 


ELEME NT 

ABA 

C 

OLO 

N 

ABACA 


ELEME NTARY 

ABA 

SEMIC 

OLO 

N 

ABACA 


ELEVE N 

ABA 

C 

OLO 

RS 

ABACA 

C 

EMETE RY 

ABA 

AUT 

OMO 

BILE 

ABACA 

S 

EVERE 

ABA 

PR 

OMO 

TE 

ABACA 

AUD 

IBILI TY 

ABA 

H 

ONO 

R 

ABACA 

EXH 

IBITI ON 

ABA 

VIG 

ORO 

US 

ABACA 

V 

ICINI TY 

ABA 

M 

OTO 

R 

ABACA 

M 

ILITI A 

ABA 

M 

OTO 

RIZED 

ABACA 

FAC 

ILITI ES 

ABA 

PR 

OVO 

ST 

ABACA 

D 

IMINI SH 

ABA 


PIP 

E 

ABACA 

L 

IMITI NG 

ABA 


POP 

ULATED 

ABACA 


INITI AL 

ABA 

LIB 

RAR 

Y 

ABACA 

DEF 

INITI ON 

ABA 

AI 

RDR 

OME 

ABACA 

D 

IRIGI BLE 
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ABACA 

SEM IRIGI D 

ABACDA 

R EVENUE 

ABACA 

REQU ISITI ON 

ABACDA 

U NKNOWN 

ABACA 

C IVILI AN 

ABACDA'/ 

PR OMOTIO N 

ABACA 

D IVISI ON 

abacdaAc 

S EVENTEEN 

ABACA 

L OCOMO TIVE 

ABACDAACD 

S EVENTEENT H 

ABACA 

M ONOPO LY 

ABACDAC 

D ESERTER 

ABACA 

PR OTOCO L 

ABACDAD 

D EFENSES 

ABACA 

CONS TITUT E 

ABACDAED 

AVAILABL E 

ABACA 

UNUSU AL 

ABACDAEEC 

N AVALBATTL E 

ABACADA 

V ISIBILI TY 

ABACDB 

F ATALIT Y 

ABACADB 

DEF INITION 

ABACDB 

A NONYMO US 

ABACADBA 

PR ECEDENCE 

ABACDB 

C OLONEL 

ABACADC 

INITIAT E 

ABACDBA 

TH EREFORE 

ABACADD 

COMPL ETENESS 

ABACDC 

R ECEIVI NG 

ABACADDA 

N AVALATTA CK 

ABACDC 

DYNA MOMETE R 

ABACADEC 

D IVISIONS 

ABACDCA 

L IMITATI ON 

ABACB 

V ACANC Y 

ABACDCCA 

NINETEEN 

ABACB 

COMB ATANT 

ABACDCCAD 

NINETEENT H 

ABACB 

C ATAST ROPHE 

ABACDCEA 

S TATEMENT 

ABACB 

D ETECT OR 

ABACDCECFGHIE 

M ETEOROLOGICAL 

ABACB 

V ISITS ’ 

ABACDD 

FIFTEE N 

ABACB 

MEMBE R 

ABACDD 

FO RTRESS 

ABACBDEC 

D ETENTION 

ABACDDEC 

FIFTEENT H 

ABACBDEC 

R ETENTION 

ABACDEA 

ELEVATE 

ABACBDEFGFAG 

NONCOMBATANT 

ABACDEA 

D EVELOPE 

ABACC 

R EBELL ION 

ABACDEA 

VER IFICATI ON 

ABACC 

N ECESS ARY 

ABACDEA 

S IMILARI TY 

ABACC 

N ECESS ITY 

ABACDEAD 

SUSPENSE 

ABACC 

CAR ELESS 

ABACDEAFGE 

SUSPENSION 

ABACC 

WIR ELESS 

ABACDEB 

EXPL ANATION 

ABACCA 

P ARALLA X 

ABACDEB 

T OPOGRAP HIC 

ABACCA 

R EPELLE D 

ABACDEBFA 

R ECEPTACLE 

ABACCA 

T OMORRO W 

ABACDEC 

ABANDON 

ABACCDACC 

CAR ELESSNESS 

ABACDEC 

D AMAGING 

ABACCDC 

P ARALLEL 

ABACDEC 

QU ARANTIN E 

ABACCDEFEA 

N ECESSITATE 

ABACDECA 

P ENETRATE 

ABACDA 

' ALASKA 

ABACDECFBA 

D ETERIORATE 

ABACDA 

ARABIA' 

ABACDECFGB 

P ENETRATION 

ABACDA 

N AVALBA £E 

ABACDED 

C APABILI TY 

ABACDA 

R ECEIVE , 

ABACDED 

M OTORCYC LE 

ABACDA 

D ECEMBE R 

ABACDED 

SUSPICI ON 

ABACDA 

D EFENSE 

ABACDEDEDC 

G ENERALALAR M 

ABACDA 

R EJECTE D 

ABACDEDlj’BA 

SUSPICIOUS 

ABACDA 

R ELEASE 1 

ABACDEDFGA 

SUSPICIONS 

ABACDA 

S ELECTE t> 

ABACDEFA 

D EFECTIVE 

ABACDA 

R EMEDIE S 

ABACDEFA 

D EFENSIVE 

ABACDA 

EMERGE NCY 

ABACDEFA 

T ELEPHONE 

ABACDA 

ENEMIE S 

ABACDEFA 

D ETERMINE 

ABACDA 

R EPEATE D 

ABACDEFA 

D EVELOPME NT 
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A^AlCDEFA 

" EXERCISE ' 

ABBA 

SH IPPI NG 

ABACDEFAF" 

" EXERCISES 

ABBA 

M ISSI NG 

ABACDEFB 

DEDICATE 

ABBA 

ADM ISSI ON * 

ABACDEFB 

ENEMYTAN KS 

ABBA 

M ISSI ON 

ABACDEFC 

DEDICATI ON 

ABBA 

PERM ISSI ON 

ABACDEFCDFE 

V ETERINARIAN 

ABBA 

F ITTI NG 

ABACDEFCFD 

ELECTRICIT Y 

ABBA 

AFTER NOON 

ABACDEFD 

SUSPECTE D 

ABBA 

NOON 

ABACDEFDF 

SUSPENDED 

ABBA 

F OLLO If 

ABACDEFE 

ANALYSIS 

ABBA 

C OMMO N 

ABACDEFGA 

EXECUTIVE 

ABBA 

OPPO SE 

ABACDEFGB 

POPULATIO N 

ABBA 

OPPO SITE " 

ABACDEFGBA 

'ENEMYPLANE s 

ABBA • "" 

B OTTO M 

ABACDEFGBA 

S EVENTYFIVE 

ABBAB 

B AGGAG E 

ABACDEFGBEHF 

D ETERMINATION 

ABBAB 

WITN ESSES 

ABACDEFGDHH 

G ENERALSTAFF 

ABBACA 

APPARA TUS 

ABACDEFGE 

MEMORANDA 

abbacA 

L ETTERE D 

ABACDEFGHA 

MEMORANDUM 

ABBACB 

V ESSELS 

ABACDEFGHIA 

D ECENTRALIZE 

ABBACDA 

M ESSENCE R 

ABBA 

AFFA IR 

ABBACDA 

EFFECTE D 

ABBA 

APPA RENT 

ABBACDB 

M ISSIONS 

ABBA 

APPA RENTLY 

abbacdeA 

IRRIGATI ON 

ABBA 

B ARRA CKS 

abbacdedA 

OPPOSITIO N 

ABBA 

B ARRA GE 

abbacdefA 

EFFECTIVE" 

ABBA 

ARRA NGE 

abbacdefA 

D IFFICULTI ES 

ABBA 

P ASSA GE 

ABBACDEFA 

IMMIGRATI ON 

ABBA 

ASSA ULT 

ABBACDEFC$ 

ILLITERATE 

ABBA 

ATTA CH 

abbacdefb! 

ATTAINMENT 

ABBA 

ATTA CK 

ABBACDEFEd 

ARRANGEMEN T 

ABBA 

ATTA IN 

ABBACDEF6# 

ATTACHMENT 

ABBA 

B ATTA LION 

ABBCA 

ANNUA L 

ABBA 

IN DEED 

ABBCA 

APPEA R 

ABBA 

EFFE CT 

ABBCA 

DIS APPEA R 

ABBA 

COMP ELLE D 

ABBCA 

C ARRIA GE 

ABBA 

SH ELLE D 

ABBCA 

S ETTLE 

ABBA 

CONF ERRE D 

ABBCA 

ISSUI NG 

ABBA 

COMPR ESSE D 

ABBCA 

FOUR TEENT H 

ABBA 

IMPR ESSE D 

ABBCA 

SIX TEENT H 

ABBA 

PR ESSE D 

ABBCA 

CHA UFFEU R 

ABBA 

V ESSE L 

ABBCA 

S URROU ND 

ABBA 

CIGAR ETTE 

ABBCADAEF<f 

APPEARANCE 

ABBA 

B ETTE R 

abbcAdAefc 

DIS' APPEARANCE 

ABBA 

L ETTE R 

ABBCADC 

APPEARE D 

ABBA 

D IFFI CULT 

ABBCBBDA 

P OSSESSIO N 

ABBA 

W ILLI AM 

abBcbda 

ASSISTA NCE 

ABBA 

F ILLI NG 

ABBCBDAED 

ASSISTANT 

ABBA 

K ILLI NG 

ABBCCDAB 

ASSOONAS 

ABBA 

REF ILLI NG 

ABBCDA 

ALLOWA NCE 

ABBA 

SW IMMI NG 

ABBCDA 

APPROA CH 
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MISCELLANEOUS PATTERNS—Continued 


ABBCDA 


ARRIVA L 

ABCA 


ADVA 

NCE 

ABBCDA 


ASSURA NCE 

ABCA 

DI 

AGRA 

M 

ABBCDA 

M 

ESSAGE 

ABCA 

EV 

ALUA 

TION 

t ABBCDA 


ILLUMI NATE 

ABCA 


ALWA 

YS 

ABBCDAB 

M 

ESSAGES 

ABCA 

C 

AMPA 

IGN 

ABBCDAB 

C 

ORRIDOR 

ABCA 

M 

ANDA 

TE 

ABBCDAEA 

B 

ELLIGERE NT 

ABCA 

M 

ANUA 

L 

ABBCDAEFC 


ALLOCATIO N 

ABCA 

J 

ANUA 

RY 

ABBCDAEFC 


IMMEDIATE 

ABCA 

C 

ANVA 

S 

ABBCDAEFGAE 


ILLUMINATIN G 

ABCA 

CH 

APLA 

IN 

ABBCDAEFGAHE 


ILLUMINATION 

ABCA 

C 

APTA 

IN 

ABBCDAEFGAHE 

D 

ISSEMINAJION 

ABCA 


AREA 

r 

ABBCDBCEA 


APPROPRIA TE 

ABCA 

DEB 

AREA 

TION 

ABBCDCA 


EFFICIE NT 

ABCA 

EMB 

AREA 

TION 

ABBCDCA 

C 

OLLISIO N 

ABCA 


ASIA 


ABBCDCAED 


EFFICIENC Y 

ABCA 

CO 

ASTA 

L 

ABBCDCAED 

C 

OLLISIONS 

ABCA 

C 

ASUA 

L 

ABBCDCEFA 


ADDITIONA L 

ABCA 

C 

ASUA 

LTY 

ABBCDDCA 

C 

OMMISSIO N 

ABCA 


AVIA 

TOR 

ABBCDDCA 

C 

OMMISSIO NER 

ABCA 


BARB 

ED 

ABBCDDCEAFGC 


ACCOMMODATIO N 

ABCA 


BOMB 


ABBCDEA 


ACCOMPA NY 

ABCA 


BOMB 

ARD 

ABBCDEA 


APPROVA L 

ABCA 

LIGHT, 

BOMB 

ER 

ABBCDEA 


ASSOCIA TE 

ABCA 

BOMB 

ER 

ABBCDEA 

SH 

ELLFIRE 

ABCA 

BRIB 

E 

ABBCDEA 

T 

ERRIBLE 

ABCA 


BULB 


ABBCDEAFB 


ACCORDANC E 

ABCA 


CANC 

EL 

ABBCDEAFB 


REENFORdfe 

ABCA 


CHEC 

K 

ABBCDEAFBC 


ACCEPTANCE 

ABCA 


CIRC 

LE 

ABBCDEAFBGBC 


REENFORCEMEN T 

ABCA 


CIRC 

ULATE 

ABBCDEAFD 


APPLICATI ON 

ABCA 


CONC 

EAL 

ABBCDEAFEC 


ASSOCIATIO N 

ABCA 


CONC 

LUDE 

ABBCDEAFGC 


ACCEPTABLE 

ABCA 

HUN 

DRED 


ABBCDEAFGC 


ALLEGIANCE 

ABCA 

L 

EADE 

R 

ABBCDEAFGHF 

C 

ORRESPONDIN G 

ABCA 


EAGE 

R 

ABBCDEFGA 


ACCIDENTA L 

ABCA 

M 

EAGE 

R 

ABBCDEFGA 


APPROXIMA TE 

ABCA 

S 

EAME 

N 

ABBCDEFGA 


OCCUPATIO N 

ABCA 

ST 

EAME 

R 

ABBCDEFGBAHAC 


IRREGULARITIE S 

ABCA 

N 

EARE 

ST 

ABBCDEFGBA 


IRREGULARI TY 

ABCA 

C 

EASE 


ABBCDEFGEA 


ILLUSTRATI ON 

ABCA 

GR 

EASE 


ABBCDEFGHAD 

C 

OMMENDATION 

ABCA 

INCR 

EASE 

D 

ABCA 

P 

ACKA GE 

ABCA 

L 

EAVE 


ABCA 

EV 

ACUA TING 

ABCA 


ECHE 

LON 

ABCA 

EV 

ACUA TION 

ABCA 

WR 

ECKE 

D 

ABCA 

R 

ADIA L 

ABCA 

INF 

ECTE 

D 

ABCA 

R 

ADIA TE 

ABCA 


EDGE 


ABCA 


ADJA CENT 

ABCA 

S 

EIZE 


ABCA 

GR 

ADUA L 

ABCA 

R 

ELIE 

F 

ttEOTRIOTED 
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ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA ' 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABcj 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA’ 

ABCA 

ABCA 

ABCA 

ABCA 
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H 

ELPE R 

ABCA 

I 

NFAN TRY 

TW 

ELVE 

ABCA 

CO 

NFIN E 

NOV 

EMBE R 

ABCA 

U 

NION 

ABS 

ENCE 

ABCA 

SU 

NKEN 

LIC 

ENSE 

ABCA 

FLA 

NKIN G 

C 

ENtE R 

abcA 

I 

NLAN D 


ENTE R 

ABCA 

' ' I 

NTEN D 


ENVE LOP 

abcA 

CO 

NTIN UAL 

R 

EQUE ST 

ABCA 

CO 

NTIN UE 

FI 

ERCE 

ABCA 

I 

NVEN T 

S 

ER6E ANT 

ABCA 


OCTO BER 

MAT 

ERIE L 

ABCA 

D 

OCTO R 

REV 

ERSE 

ABCA 

F 

OGHO RN 

OBS 

ERVE 

ABCA 

P 

OISO N 

R 

ESPE CT 

ABCA 

C 

OMPO SED 

W 

ESTE RLY 

ABCA 

C 

ONVO Y 

W 

ESTE RN 

ABCA 

EN 

ORMO US 


ETHE “ 

ABCA 

EXPL 

OSIO N 

ft AN 

EUVE R 

ABCA 


PUMP 

R 

EVIE W 

ABCA 


PURP OSE 


EXCE PT 

ABCA 

HA 

RBOR 


EXPE CT 

,abca 

AI 

RBOR NE 


EXPE ND 

ABCA 

MU 

RDER 


EXTE ND 4 

ABCA 

0 

RDER 


gaug'e 

ABCA 

0 

RDER S 


GEOG RAPHIt 

ABCA 


REAR 

FOR 

GING 

ABCA 


RECR UIT 

W 

HICH 

ABCA 

COU 

RIER 


HIGH 

ABCA 

P 

RIOR 


HIGH ER 

ABCA 

SUPE 

RIOR 


HIGH EST 

ABCA 

A 

RMOR 

V 

ICTI M 

ABCA 

A 

RMOR Y 

M 

IDNI GHT 

ABCA 

P 

ROGR AM 

DR 

IFTI NG 

ABCA 

MO 

RTAR 

L 

IFTI NG 

ABCA 

' QUA 

RTER 

S 

IGNI FY 

ABCA 

QUA 

RTER S 

BU 

ILDI NG 

ABCA 

FEB 

RUAR Y 


INDI CATE 

ABCA 

FO 

RWAR D 


INDI RECT 

ABCA 

CEN 

SORS HIP 

DESCR 

IPTI ON 

ABCA, 


SUNS ET 

L 

IQUI D 

ABCA 

IMPOR 

TANT 

A 

IRFI ELD 

ABCA 

S 

TART 


REPR ISAL 

ABCA 

PRO 

TECT 

M 

ISFI RE 

ABCA 


TENT 

F 

ISHI NG 

ABCA 


TENT H 

W 

ITHI N 

ABCA 

PRO 

TEST 

FUE 

LOIL 

ABCA 


TEXT 


MAIM 

ABCA 


THAT 

LA 

NDIN G 

|BCA 

S 

TRAT EGIC 
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ABCA 

S 

TRAT EGY 

ABCAC 

P 

ABCA 

D 

UGOU T 

ABCAC 

PRO 

ABCA 


UNSU ITABLE 

ABCACA 

D 

ABCA 

P 

URSU E 

ABCACB 

0 

ABCA 

P 

URSU IT 

ABCACBDEC 


ABCA 

0 

UTGU ARD 

ABCACDEFD 


ABCAA 

D 

ECREE 

ABCADA 


ABCAA 

D 

EGREE 

ABCADA 

R 

ABCAA 

B 

ETWEE N 

ABCADA 

C 

ABCAA 

DI 

SCUSS 

ABCADA 

B 

ABCAA 

A 

SPOSS IBLE 

ABCADA 

R 

ABCAAB 

P 

ONTOON 

ABCADAB 

CO 

ABCAAB 


THATTH E 

ABCADAC 

S 

ABCAACDEB 

P 

REARRANGE D 

ABCADAC 


ABCAB 

W 

ARFAR E 

ABCADAEC 


ABCAB 

S 

ECREC Y 

abcadAefb 


ABCAB 

OBS 

ERVER 

ABCADAEFC 

S 

ABCAB 

W 

HETHE R 

ABCADfEFCE 

S 

ABCAB 

B 

INDIN G 

ABCADAEFGHF 


ABCAB 

F 

INDIN G 

ABCAD& 


ABCAB 

S 

INKIN G 

ABCADB 


ABCAB 

PA 

INTIN G 

ABCADB 


ABCAB 

PR 

INTIN G 

ABCADB 

w 

ABCAB 

I 

NTENT 

ABCADB 

LOG 

ABCAB 

P 

ONTON 

ABCADB 

EXPL 

ABCAB 

C 

ORPOR AL 

ABCADB 


ABCAB 


RECRE ATION 

ABCADB 

IM 

ABCAB 

P 

RIORI TY 

ABCADB 


ABCAB 

SUPE 

RIORI TY 

ABCADBA 


ABCAB 

DI 

SEASE 

ABCADBC 


ABCAB 

PRO 

TECTE D 

ABCADBCEFCGG 


ABCAB 

PRO 

TESTE D 

ABCADBD 


ABCAB 

0 

UTPUT 

ABCADBEFD 


ABCABA 

INT 

ERFERE 

ABCADG 

R 

ABCABB 

D 

ISMISS 

ABCADC 

ST 

ABCABB 

D 

ISMISS AL 

ABCADC 

V 

ABCABC 


THATHA VE 

ABCADC 


ABCABCA 


ENTENTE 

ABCADC 


ABCABDA 

S 

ENTENCE 

ABCADC 

R 

ABCABDB 


REPRESE NT 

ABCADC 


ABCABDBEFGFHIB 


REPRESENTATIVE 

ABCADC 

I 

ABCABDBEFGFHIED 


REPRESENTATIONS 

ABCADC 


ABCABDC 


RETREAT 

ABCADC 

S 

ABCABDED 

M 

ANGANESE 

ABCADCj 


ABCABDEFA 

C 

ORPORATIO N 

ABCADC^. 

CO 

ABCABDEFGHD 


RECREATIONA L 

ABCADCB 

CO 

ABCAC 


ARMAM ENT 

ABCADCB 


ABCAC 

N 

EARER 

ABCADCEFBGABC 


ABCAC 


PROPO SE 

ABCADCEFGED 
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ABCADCEPGEHC 

CONCENTRATIN G 

ABCADEFA 

ENVELOPE 

ABCADCEFGEHBC 

CONCENTRATION 

ABCADEFA 

EXPEDITE 

ABCADD 

D EPRESS ION / 

ABCADEFA 

EXPERIME NT 

ABCADD 

EXCESS j 

ABCADEFAB 

INDICATIN G 

ABCADD 

D ISTILL I 

ABCADEFAB 

D ISTINGUIS H 

ABCADD 

P OSTOFF ICE / 

ABCADEFABGADE 

D ISTINGUISHING 

ABCADD 

B OYCOTT / 

ABCADEFAGB 

INDICATION 

ABCADDA 

AMBASSA DOR 

ABCADEFB 

ADVANCED 

ABCADDA 

EXPELLE D 

ABCADEFBA 

EXT RAORDINAR Y 

ABCADDECCPA 

UNSUCCESSFU L 

ABCADEFC 

BOMBARDM ENT 

ABCADDEFA 

EXCESSIVE 

ABCADEFC 

CIRCULAR 

ABCADEA 

ADVANTA GE 

ABCADEFC 

U NTENABLE 

ABCADEA 

ADVANTA GEOUS 

ABCADEFCGHB 

RETROACTIVE 

ABCADEA 

D ECREASE 

ABCADEFD 

ADVANCIN G 

ABCADEA 

S EPTEMBE R 

ABCADEFD 

EXTENDIN G 

ABCADEA 

R EQUESTE D 

ABCADEFD 

EXTERIOR 

ABCADEA 

D ISCIPLI NE 

ABCADEFE 

CONCRETE 

ABCADEAB 

CO NTINGENT 

ABCADEFE 

EXPEDITI NG 

ABCADEAE 

EXPENDED 

ABCADEFE 

EXPEDITI ON 

ABCADEAE 

EXPENSES 

ABCADEFE 

OBSOLETE 

ABCADEAE 

EXTENDED 

ABCADEFE 

G ONIOMETE R 

ABCADEAFA 

ELSEWHERE 

ABCADEFE 

PURPOSES 

ABCADEAFGA 

EXPERIENCE 

ABCADEFE 

RECRUITI NG 

ABCADEB 

C ENTERIN G 

ABCADEFEA 

C OMPOSITIO N 

ABCADEB 

ENTERIN G 

ABCADEFGA 

EXPENSIVE 

ABCADEB 

R ESPECTS 

ABCADEFGA 

EXTENSIVE 

ABCADEB 

INCIDEN T 

ABCADEFGAF 

ECHELONMEN T 

ABCADEB 

M ISFIRES 

ABCADEFGB 

C ASUALTIES 

Abcadebce 

INCIDENCE 

ABCADEFGB 

CIRCULATI ON 

ABCADEC 

M ANDATED 

ABCADEFGBC 

CONCLUSION 

ABCADEC 

S ECRETAR Y 

ABCADEFGC 

INDICATED 

ABCADEC 

GYR OSCOPIC 

ABCADEFGC 

S TRATEGICA L 

ABCADECA 

REARGUAR D 

ABCADEFGD 

EXTENSION 

ABCADECAFD 

D ISTINCTION 

ABCADEFGDC 

CONCEALMEN T 

ABCADECFC 

CONCERNIN G 

ABCADEFGE 

REPRISALS 

ABCADEDA 

CO NFINEMEN T 

ABCADEFGF 

BOMBARDED 

ABCADEDAFB 

INVITATION 

ABCADEFGHAB 

C ONFORMATION 

ABCADEDBD 

SUBSTITUT E 

ABCADEFGHCA 

EXTERMINATE 

ABCADEDBDE 

SUBSTITUTI ON 

ABCADEFGHCFIG 

EXTERMINATION 

ABCADEDC 

LI EUTENANT 

ABCADEFGHEIGCF 

REORGANIZATION 

ABCADEDFGA 

ENTERPRISE 

ABCADEFGHH 

R ESPECTFULL Y 

ABCADEDPGDBC 

CONCILIATION 

ABCADEFGHIAJF 

CIRCUMSTANCES 

ABCADEDFGFB 

ENTERPRISIN G 

ABCADEFGHIB 

RETROACTIVE 

ABCADEE 

P ROGRESS 

ABCADEFGHIE 

GEOGRAPHICA L 

ABCADEEBFGHC 

CANCELLATION 

ABCADEFGHIGBH 

CIRCUMSTANTIA L 

ABCADEED 

CANCELLE D 

ABCBA 

COMP LETEL Y 

ABCADEEFBC 

CONCESSION 

ABCBA 

AWKWA RD 

ABCADEEPGD 

P ROGRESSIVE 

ABCBA 

CAPAC ITY 

ABCADEFA 

ECHELONE D 

ABCBA 

PA CIFIC 
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ABCBA 

SPE CIFIC 

ABCBDEBA 

RECEIVER 

ABCBA 

HIN DERED 

ABCBDEBA 

REPEATER 

ABCBA 

DIVID E 

ABCBbEFA 

REJECTOR 

ABCBA 

GARAG E 

ABCBDEFA 

STATIONS 

ABCBA 

C ITATI ON 

ABCBDEFBA 

DEVELOPED 

ABCBA 

LEVEL 

ABCBDEFGA 

R ESISTANCE 

ABCBA 

P REFER 

ABCBDEFGBA 

DETERMINED 

ABCBA . 

REFER 

ABCBDEFGHFA 

DISINFECTED 

ABCBA 

P RESER VATION 

ABCBDEFGHIJBA 

DECENTRALIZED 

ABCBA 

RESER VATION 

ABCCA 


LITTL E 

ABCBA 

TAXAT ION 

ABCCA 


PASSP ORT 

ABCBA 

HOS TILIT Y 

ABCCA 


S TREET 

ABCBA 

U TILIT Y 

ABCCABDEC 

C ROSSROADS 

ABCBA 

AC TIVIT Y 

ABCCBADED 

MILLIMETE R 

ABCBAA 

U SELESS 

ABCCBCA 


BE GINNING 

ABCBAAB 

P REFERRE D 

ABCCBDA 


INF LAMMABL E 

ABCBAB 

DIVIDI NG 

ABCCDA 


COLLEC T 

ABCBAB 

AC TIVITI ES 

ABCCDA 


CORREC T 

ABCBABDEB 

P REFERENCE 

ABCCDA 


T RIGGER 

ABCBABDEB 

REFERENCE 

ABCCDA 


RUBBER 

ABCBADA 

MINIMUM 

ABCCDA 


RUNNER 

ABCBADB 

P RESERVE 

ABCCDA 


SPOOLS 

ABCBADB 

RESERVE 

ABCCDA 


SPOONS 

ABCBADB 

REVERSE 

ABCCDA 


SUGGES T 

ABCBADBC 

RESERVES 

ABCCDA 


SUPPOS E 

ABCBADEB 

SPE CIFICATI ON 

ABCCDA 


TURRET 

ABCBCDBA 

REMEMBER 

ABCCDAA 


SUCCESS 

ABCBDA 

DEFEND 

ABCCDAAEB 

SUCCESSFU L 

ABCBDA 

DEPEND 

ABCCDAAEBFF 

SUCCESSFULL Y 

ABCBDA 

MU NITION S 

ABCCDAAEFD 

SUCCESSIVE 

ABCBDA 

RESEAR CH 

ABCCDAB 


P RESSURE 

ABCBDA 

STATES 

ABCCDAEC 

TERRITOR Y 

ABCBDA 

STATUS 

ABCCDAED 

CORRECTE D 

ABCBDA 

IN TEREST 

ABCCDAEFB 

COLLECTIO N 

ABCBDAB 

DEFENDE R 

ABCCDAEFB 

CORRECTIO N 

ABCBDAB 

E NGAGING 

ABCCDAEFBC 

CONNECTION 

ABCBDABA 

DEFENDED 

ABCCDAEFC 

CONNECTIN G 

ABCBDABD 

DEPENDEN T 

ABCCDAEFDGG 

CORRECTNESS 

ABCBDABDEA 

STATISTICS 

ABCCDEA 


GASSING 

ABCBDAEFGB 

DEPENDABLE 

ABCCDEA 


GETTING 

ABCBDAEFGHG 

DEPENDABILI TY 

ABCCDEA 


ST RAGGLER 

ABCBDCBA 

PARAGRAP H 

ABCCDEA 


IN TERRUPT 

ABCBDDBA 

DEFERRED 

ABCCDEAB 

IN TERRUPTE D 

ABCBDEA 

E CONOMIC 

ABCCDEAD 

COMMENCE 

ABCBDEA 

DAMAGED 

ABCCDEAD 

COMMERCE 

ABCBDEA 

PO LITICAL 

ABCCDEADCDE 

COMMENCEMEN T 

ABCBDEAEC 

MANAGEMEN T 

ABCCDEBFGHDA 

DISSEMINATED 

ABCBDEBA 

DEFEATED 

ABCCDEFA 

COMMUNIC ATE 

ABCBDEBA 

DESERTED 

ABCCDEFA 

SUPPLIES 
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ABCCDEFAGHFBE! 

COMMUNICATION 

ABCDA 


INSPI RE 

ABCCDEFBGHDGAD 

CORRESPONDENCE 

ABCDA 


LOCAL 

ABCCDEFGA 

R EAPPOINTE D 

ABCDA 

LAU 

NCHIN G 

ABCCDEFGHAFG 

R EAPPOINTMENT 

ABCDA 

, co 

NDEMN 

ABCDA 

S ABOTA GE 

ABCDA 

MACHI 

NEGUN 

ABCDA 

R AILWA Y 

ABCDA 


NOTIN G 

ABCDA 

ANIMA L 

ABCDA 

EXPA 

NSION 

ABCDA 

S ANITA RY 

ABCDA 

CO 

NT AIN 

ABCDA 

M ARSHA L 

ABCDA 

MOU 

NTAIN 

ABCDA 

M ARTIA L 

ABCDA 

I 

NTERN AL 

ABCDA 

E ASTWA RD 

ABCDA 

FRO 

NTLIN E 

ABCDA 

N ATURA L 

ABCDA 

I 

NTREN CH 

ABCDA 

N ATURA LIZE 

ABCDA 

C 

ONTRO L 

ABCDA 

TE CHNIC. AL 

ABCDA 

H 

ORIZO N 

ABCDA 

COUNC IL 

ABCDA 


OUTBO ARD 

ABCDA 

R EACHE D 

ABCDA 


PROMP T 

ABCDA 

L EAGUE 

ABCDA 


RECOR D 

ABCDA 

EASTE RLY 

ABCDA 


REPOR T 

ABCDA 

EASTE RN 

ABCDA 


RETUR N 

ABCDA 

W EATHE R 

ABCDA 

P 

RIMAR Y 

ABCDA 

H EAVIE R 

ABCDA 


RIVER 

ABCDA 

INS ECURE 

ABCDA 


ROGER 

ABCDA 

S ECURE 

ABCDA 

FA 

RTHER 

ABCDA 

R EDUCE 

ABCDA 

FU 

RTHER 

ABCDA 

SCH EDULE 

ABCDA, 

NO 

RTHER LY 

ABCDA 

B EFORE 

ABCDA 


SATIS FY 

ABCDA 

R EFUGE 

ABCDA 


SHIPS 

ABCDA 

R EFUSE 

ABCDA 

WAR 

SHIPS 

ABCDA 

R EGIME NT 

ABCDA 


THIRT Y 

ABCDA 

R EGIME NTAL 

ABCDA 

WI 

THOUT 

ABCDA 

EITHE R 

ABCDA 

EX 

TRACT 

ABCDA 

FUS ELAGE 

ABCDA 

\ 

TRACT 

ABCDA 

D ELIVE R 

ABCDA 

INS 

TRUCT 

ABCDA 

GR ENADE 

ABCDA 

DES 

TRUCT ION 

ABCDA 

ERASE 

ABCDA 


TWENT Y 

ABCDA 

OP ERATE 

ABCDA 

B 

UREAU 

ABCDA 

R ESCUE 

ABCDA 


WESTW ARD 

ABCDA 

PR ESIDE NT 

ABCDAA 

R 

EFUGEE 

ABCDA 

R ESUME 

ABCDAA 

C 

ODEBOO K 

ABCDA 

D EVICE 

ABCDAA 

BU 

SINESS 

ABCDA 

D EVISE 

ABCDAA 

DI 

STRESS 

ABCDA 

GOING 

ABCDAA 


STRESS 

ABCDA 

T HOUGH 

ABCDAAD 

F 

ORENOON 

ABCDA 

C HURCH 

ABCDAB 


DECIDE 

ABCDA 

F IGHTI NG 

ABCDAB 


DECODE 

ABCDA 

INFLI CT 

ABCDAB 

SP 

EARHEA D 

ABCDA 

EXT'INGUI SH 

ABCDAB 

R 

EDUCED 

ABCDA 

INQUI RE 

ABCDAB 


ENTREN CH 

ABCDA 

INQUI RY 

ABCDAB 
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ABCDAB 

GEORGE 

ABCDAB 

POSTPO NE 

ABCDAB 

RETIRE 

ABCDAB 

ES TIMATI ON 

ABCDABA 

DECIDED 

ABCDABAB 

INCLININ G 

ABCDABC 

M AINTAIN 

ABCDABC 

M AINTAIN ED 

ABCDABCEFD 

PHOSPHORUS 

ABCDABEFA 

ENTRENCHE D 

ABCDAC 

L ANGUAG E 

ABCDAC 

ANYWAY 

ABCDAC 

GOV ERNMEN T 

ABCDAC 

I NSTANT 

ABCDAC 

I NSTANT LY 

ABCDAC 

DI SPERSE 

ABCDAC 

RES TRICTI ON 

ABCDAC 

PA TRIOTI C 

ABCDACB 

CO NDEMNED 

ABCDACDAEFGB 

I NSTANTANEOUS 

ABCDACEFDAF 

COINCIDENCE 

ABCDAD 

MOVEME NT 

ABCDAD 

A MUSEME NT 

ABCDAD 

RIGORO US 

ABCDADC 

S ANITATI ON 

ABCDADEDAFB 

INSTITUTION 

ABCDADEFEAGC 

ANTIAIRCRAFT 

ABCDAEA 

EXTREME 

ABCDAEA 

MAXIMUM 

ABCDAEAB 

SU ITABILIT Y 

ABCDAEABD 

UNI TEDSTATES 

ABCDAEAE 

PAR ENTHESES 

ABCDAEB 

F IGHTING 

ABCDAEB 

S IGHTING 

ABCDAEB 

RAILROA D 

ABCDAEB 

REPORTE D 

ABCDAEB 

RETURNE D 

ABCDAEB 

TRACTOR 

ABCDAEB 

INS TRUCTOR 

ABCDAEBA 

RECORDER 

ABCDAEBC 

DE TONATION 

ABCDAEBFBDC 

U NIDENTIFIED 

ABCDAEBFC 

SATISFACT ORY 

ABCDAEC 

AVERAGE 

ABCDAEC 

D ISTRICT 

ABCDAEC 

OUTPOST 

ABCDAECA 

TWENTIET H 

ABCDAECAB 

I NTERNMENT 

ABCDAECB 
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ABCDAECD 

L 

ABORATOR Y 

ABCDAECE 


OUTPOSTS 

ABCDAECFD 

EX 

AMINATION 

ABCDAED 

T 

RAVERSE 

ABCDAEE 


ACTUALL Y 

ABCDAEE 


EXPRESS 

ABCDAEE 


THIRTEE N 

ABCDAEEFAB 


THIRTEENTH 

ABCDAEFA 

OV 

ERWHELME D 

ABCDAEFAB 


INFLICTIN G 

ABCDAEFB 

P 

RESCRIBE D 

ABCDAEFBE 

0 

NEHUNDRED 

ABCDAEFC 

M 

ANUFACTU RE 

ABCDAEFC 

PR 

ESIDENTI AL 

ABCDAEFC 

D 

ISTRIBUT E 

ABCDAEFCA 

D 

ISTRIBUTI NG 

ABCDAEFCA 

D 

ISTRIBUTI ON 

ABCDAEFD 

F 

LASHLIGH T 

ABCDAEFD 

C 

ONTROVER SY 

ABCDAEFD 

A 

SCENSION 

ABCDAEFD 


WINDWARD 

ABCDAEFDB 


RESTRICTE D 

ABCDAEFDE 


RESTRICTI ON 

ABCDAEFE 

PAR 

ENTHESIS 

ABCDAEFE 


RETURNIN G 

ABCDAEFEGE 

RE 

SPONSIBILI TY 

ABCDAEFF 


REDCROSS 

ABCDAEFGAHB 


INSPIRATION 

ABCDAEFGC 


REGARDING 

ABCDAEFGD 


RESTRAINT 

ABCDAEFGFE 

TR 

ANSPACIFIC 

ABCDAEFGHC 


TWENTYFIVE 

ABCDAEFGHFBC 


CONSCRIPTION 

ABCDBA 

PR 

ACTICA L 

ABCDBA 

W 

ATERTA NK 

ABCDBA 

DIV 

EBOMBE R 

ABCDBA 


ENGINE 

ABCDBA 

S 

ENTINE L 

ABCDBA 

R 

EVOLVE 

ABCDBA 
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ABCDBCAEB 

INTENTION 

ABCDCEBA 

ELIGIBLE 

ABCDBCEA 

A ERODROME 

ABCDCECA 
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ABCDCECDA 
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ABCDBEA 
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ABCDCEFGAB 
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ABCDEA 

S ERVICE 

ABCDEABFD 

NATIONALI SM 

ABCDEA 

EUROPE 

ABCDEABFDC 
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ABCDEABFE 
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ABCDEABFFGHD 
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AMERICA N 

ABCDECFA 

H 

ENCIRCLE 

ABCDEFA 

C 

AMOUFLA GE 

ABCDECFA 


EVACUATE 

ABCDEFA 


CHRONIC AL 

ABCDECFBA 


SEAPLANES 

ABCDEFA 


CONFLIC T 

ABCDECFEA 


STANDARDS 

ABCDEFA 

DIS 

CREPANC Y 

ABCDEDA 

N 

EWSPAPE R 

ABCDEFA 

S 

EABORNE 

ABCDEDA 


MARITIM E 

ABCDEFA 


EMPLOYE R 

ABCDEDA 

CO 

NTRABAN D 

ABCDEFA 


ENCIPHE R 
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ABCDEFABA 

EMPLACEME NT 

ABCDEFBABGHD 

MEASUREMENTS 

ABCDEFAC ' 

INTEGRIT Y 

ABCDEFBGA 

ENDURANCE 

ABCDEFAC 

P RISONERS 

ABCDEFBGBA 

DECIPHERED 

ABCDEFACB 

IN TRODUCTOR Y 

ABCDEFCA 

ESTIMATE 

ABCDEFACD 

ALTERNATE 

ABCDEFCA 

NORTHERN 

ABCDEFACGF 

ALTERNATIN G 

ABCDEFCAB 

ESTIMATES 

ABCDEFAD 

CONTRACT 

ABCDEFCAD 

D OMINATION 

ABCDEFAD 

D ESTROYER 

ABCDEFCAGFC 

ESTIMATEDAT 

ABCDEFAD 

INTERVIE W 

ABCDEFCBA 

DETONATED 

ABCDEFAD 

OPERATOR 
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AEROPLANE 
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F INGERPRIN T 
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D ETACHMENT 

ABCDEFEAGACE 

R EINSTATEMENT 
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INFLATION 

ABCDEFEAGDB 

CERTIFICATE 
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ABCDEFAGB 

TRAJECTOR Y 

ABCDEFECAE 
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ABCDEFAGBDB 
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UN DERSTOOD 

ABCDEFAGEC 

D EPARTMENTA L 

ABCDEFFA 

IMPRESSI ON 

ABCDEFAGFD 

REGISTRATI ON 
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T ECHNIQUE 
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ABCDEFGA 

E 

NTRUCKIN G / 

ABCDEFGDHFAE 

ORGANIZATION 

ABCDEFGA 


NUMBERIN G / 
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ABCDEFGA 
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IC 

TE 

D- 



QU 

AR 

TE 

RM 

AS 

TE 

R- 




IN 

TE 

RP 

RE 

TE 

R- 




IN 

TE 

RR 

UP 

TE 

D- 



-F 

OR 

TI 

FI 

CA 

TI 

ON 




AB 



— 

_AB 



AR 

MO 

RE 

DC 

AR 



EN 

FO 

RC 

EM 

EN 

T- 

RE 

EN 

FO 

RC 

EM 

EN 

TS 


IN 

DE 

TE 

RM 

IN 

AT E- 


IN 

TE 

RE 

ST 

IN 

G— 


IN 

TE 

RF 

ER 

IN 

Q— 


IN 

TE 

RV 

EN 

IN 

G- 

-I 

NC 

OM 

PE 

TE 

NC 

E- 

-C 

ON 

GR 

ES 

SI 

ON 

AL 

-D EM 

ON 

ST 

RA 

TI 

ON 


-C 

ON 

SU 

MP 

TI 

ON 



PH 

OT 

OG 

RA 

PH 



TH 

IR 

TE 

EN 

TH 



AB-AB 


-I 

-C 

-C 

-C 


NS TA LL AT 10 NS 
ON CE NT RA TI ON 
ON FL AG RA TI ON 
ON SI DE RA TI ON 


AB — AB AB 


MA 


IN CL IN IN 
IN TA IN IN 


G- 

G- 


RE3TIUCTEB 
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£ DIGRAPHIC IDIOMORPHS PLAYFAIR 


n 


AB BA 


sc 

AB 

BA 

RD 


SH 

EL 

LE 

D- 


AF 

FA 

BL 

E- 

' -H 

EM 

ME 

DI N- 


AF 

FA 

IR 


ST 

EM 

ME 

D- 

-B 

AG 

GA 

GE 


ST 

EP 

PE 

D- 

-H AW 

AI 

IA 

N- 


AV 

ER 

RE 

D- 


AL 

LA 

RE 

AS 

1 CO NF 

ER 

RE 

D- 

-B 

AL 

LA 

ST 


1 -I NT 

ER 

RE 

D- 

-F 

AL 

LA 

CY 


, *-R EF 

ER 

RE 

D- 

IN ST 

AL 

LA 

TI 

ON S- 

’ 1 

. i 

ES 

SE 

NC E- 

-P AR 

AL 

LA 

X- 


1 

i 

ES 

SE 

NT IA 


AP 

PA 

RA 

TU S- 

Ad dr 

ES 

SE 

S- 


AP 

PA 

RE 

L— 

-C OM PR 

ES 

SE 

D- 


AP 

PA 

RE 

NT 

CO NF 

ES 

SE 

D- 


AP 

PA 

RE 

NT LY 

IM PR 

ES 

SE 

D- 


AR 

RA 

NG 

E- 

[ —L 

ES 

SE 

N- 


AR 

RA 

Y- 


-M 

ES 

SE 

NG ER 

-B 

AR 

RA 

CK 

S- 

PR 

ES 

SE 

D- 

-B 

AR 

RA 

GE 

l 

PR OF 

ES 

SE 

D- 

-E MB 

AR 

RA 

SX 

SE D- 

-P RO GR 

ES 

SE 

D- 

-N 

AR 

RA 

TI 

ON 

-S TR 

ES 

SE 

D- 


AS 

SA 

IL 

AN T- 

-S TR 

ES 

SE 

S- 


AS 

SA 

UL 

T- 

-V 

ES 

SE 

L— 

-A MB 

AS 

SA 

DO 

R- 

WI TN 

ES 

SE 

s- 

-I MP 

AS 

SA 

BL 

E— 

AB 

ET 

TE 

D- 

-M 

AS 

SA 

CR 

E- 

-C IG AR 

ET 

TE 

S- 

-P 

AS 

SA 

GE 


-B 

ET 

TE 

R- 


AT 

TA 

CH 


-L 

ET 

TE 

R- 


AT 

TA 

CK 


-E IG 

HT 

TH 

RE E- 


AT 

TA 

IN 


-R 

IB 

BI 

NG 

-B 

AT 

TA 

LI 

ON 

FO RB 

ID 

DI 

NG 

-R 

AT 

TA 

N- 


-D 

IF 

FI 

CU LT 


BO 

OB 

YT 

RA P- 

-B 

IL 

LI 

ON 

IN 

DE 

ED 



-F 

IL 

LI 

NG 

-W 

EB 

BE 

D- 


-K 

IL 

LI 

NG 


EF 

FE 

CT 


-M 

IL 

LI 

ME TE 


EF 

FE 

CT 

IV E- 

-M 

IL 

LI 

NG 

CO MP 

EL 

LE 

D- 


-M 

IL 

LI 

ON 

-E XC 

EL 

LE 

NC 

E— 

SH 

IL 

LI 

NG 

-E XC 

EL 

LE 

NT 


SP 

IL 

LI 

NG 

-E XP 

EL 

LE 

D- 


-T 

IL 

LI 

NG 

-I MP 

EL 

LE 

D- 


-W 

IL 

LI 

AM 

-P 

EL 

LE 

T- 


~W 

IL 

LI 

NG 

PR OP 

EL 

LE 

D- 



IM 

MI 

GR AN 

-R EP 

EL 

LE 

D- 



IM 

MI 

GR AT 
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j^EDTRICTE B- * 





AB 

BA 






AB 

— 

BA 






IM 

MI 

NE 

NT 



PR 

AC 

TI 

CA 

BL 

E- 



SW 

IM 

MI 

NG 




PR 

AC 

TI 

CA 

L- 



-B 

EG 

IN 

NI 

NG 




-T 

AC 

TI 

CA 

L- 




SP 

IN 

NI 

NG 



-D 

IV 

EB 

OM 

BE 

R- 




-W 

IN 

NI 

NG 





EN 

GI 

NE 

ER 




CL 

IP 

PI 

NG 




-G 

EN 

UI 

NE 





SH 

IP 

PI 

NG 



-I 

NT 

ER 

FE 

RE 




-S 

TR 

IP 

PI 

NG 



-I 

NT 

ER 

FE 

RE 

NC 

E- 




IR 

RI 

GA 

TI 

ON 

-P 

EN 

ET 

RA 

TE 





-M 

IS 

SI 

NG 




-R 

EV 

OL 

VE 

R- 




-M 

IS 

SI 

ON 





IN 

FI 

NI 

TE 



-A 

DM 

IS 

SI 

ON 




-D 

IS 

PO 

SI 

TI 

ON 



EM 

IS 

SI 

ON 




-S 

IT 

UA 

TI 

ON 




-H 

IS 

SI 

NG 




CA 

NA 

DI 

AN 




PE 

RM 

IS 

SI 

ON 



VE TE 

RI 

NA 

RI 

AN 



TR 

AN 

SM 

IS 

SI 

ON 




NI 

NE 

TE 

EN 





EM 

IT 

TI 

NG 




NI 

NE 

TE 

EN 

TH 




-F 

IT 

TI 

NG 





PE 

RC 

EP 

TI 

ON 


-S 

PL 

IT 

TI 

NG 




-P 

RE 

MI 

ER 




PE 

RM 

IT 

TI 

NG 



—s 

UR 

RE 

ND 

ER 



-A 

FT 

ER 

NO 

ON 




-0 

UR 

SE 

LV 

ES 




FO 

RE 

NO 

ON 




TH 

EM 

SE 

LV 

ES 






NO 

ON 

TI 

ME 



DE 

SE 

RV 

ES 





-F 

OL 

LO 

W- 




RE 

SE 

RV 

ES 





-H 

OL 

LO 

W- 





SE 

RV 

ES 





-C 

OM 

MO 

N- 












-C 

OM 

MO 

TI 

ON 









PO 

SI 

TI 

ON 

NO 

RT 

HO 

F- 









-R 

EC 

ON 

NO 

IT 

ER 












OP 

PO 

RT 

UN 

E- 











OP 

PO 

RT 

UN 

IT Y- 











OP 

PO 

SE 













OP 

PO 

SI 

TE 












OP 

PO 

SI 

TI 

ON 










-C 

OR 

RO 

BO 

RA 

TE 










-C 

OR 

RO 

DE 











-T 

CM 

OR 

RO 

W- 












-B 

OT 

TO 

M- 












-C 

OT 

TO 

N- 












CA 

RE 

ER 













-S 

UC 

CU 

MB 

ED 
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AB 

— 

— 

BA 

DE 

BA 

RK 

ED 


DE 

CL 

AR 

ED 


DE 

FE 

ND 

ED 


DE 

MA 

ND 

ED 


DE 

PA 

RT 

ED 


DE 

PL 

OY 

ED 


DE 

PO 

RT 

ED 


DE 

SE 

RT 

ED 


DE 

TA 

CH 

ED 


EC 

ED 

EN 

CE 


EM 

PL 

OY 

ME 

NT 

EN 

TR 

AI 

NE 

D- 

ME 

AS 

UR 

EM 

EN T- 

NE 

GL 

IG 

EN 

CE 

NO 

TA 

TI 

ON 


PA 

RA 

GR 

AP 

H- 

RE 

CE 

IV 

ER 


RE 

CO 

RD 

ER 


RE 

GI 

ST 

ER 


RE 

PE 

AT 

ER 


RE 

PO 

RT 

ER 


RE 

VO 

LV 

ER 


RO 

JE 

CT 

OR 


SE 

MB 

LI 

ES 



-D 

*—s 



AB 

— 

— 

— 

BA 

DE 

SE 

CR 

AT 

ED 


DE 

SI 

GN 

AT 

ED 


DE 

SP 

AT 

CH 

ED 


EN 

EM 

YP 

LA 

NE 

S- 

!ET 

ER 

10 

RA 

TE 


!ev 

EN 

TY 

FI 

VE 


IR 

RE 

GU 

LA 

RI 

TY 

NO 

MI 

NA 

TI 

ON 


SU 

SP 

IC 

10 

US 



AB-BA 


DE MO NS TR AT ED 
NO TI FI CA TI ON 


t 
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F. DIGRAPHIC IDIOMORPHS: FOUR-SQUABE 1 


(Grouped by number of significant letters in the ldiomorphic pattern) 


Two letters 





A- 

A- 





A- 

A- 





A- 

— 


A- 




B 

10 

CK 

AD 

ED 





SQ 

UA 

DR 

ON 



MO" 

VE 

'ME 

NT 





I 

HV 

AD 

ED 



FI 

GH 

TE 

RP 

LA 

NE 


E 

MP 

LA 

CE 

ME 

NT 





D 

AM 

AG 

E 



MO 

TO 

RI 

ZE 

D 



PE 

ES 

ON 

NE 

L 




CO 

MM 

AN 

DS 



D EP 

AR 

TU 

HE 




A 

ET 

IL 

LE 

RY 





I 

SL 

AH 

DS 





UN 

US 

UA 

L 










A 

IR 

PL 

AN 

ES 











A- 

— 

-- 

— 

A- 



E RE 

my 

PL 

AN 

ES 




A- 

- ~ 

A- 





CO 

MM 

UN 

IC 

AT 

10 

NS 

DE 

SI 

GN 

AT 

ED 



S 

AB 

w 

AG 

E 




CO 

NC 

EN 

TR 

AT 

E 


E 

ST 

IM 

AT 

ED 



D ET 

AC 

EM 

EN 

T 



R 

EO 

PG 

AN 

IZ 

AT 

10 

N 

I 

HD 

IC 

AT 

ED 



H 

AS 

BE 

EN 





LI 

EU 

TE 

NA 

NT 





C 

AV 

AL 

RY 



BA 

TT 

AL 

10 

N 


CO 

NS 

TR 

UC 

TI 

ON 





H 

AV 

AL 




BO 

MB 

ED 













P 

RO 

CE 

DU 

EE 



CA 

SU 

AL 

TI 

ES 



A- 

— 

-- 



A- 




ME 

CH 

AN 

IZ 

ED 


CA 

SU 

AL 

TY 




CO 

MM 

IS 

si" 

ON 

ED 



IM 

ME 

DI 

AT 

EL 

Y 


CO 

MB 

AT 













WI 

TH 

DR 

ATT 




CO 

OR 

DI 

NA 

TE 

s 




-B 

-B 





WI 

TH 

DR 

EW 


1 


DI 

RE 

CT 

10 

N 




UN 

AB 

LE 







EM 

ER 

GE 

NC Y 


DI 

SP 

AT 

CH 




OB 

ST 

AC 

IE 




L 

IE 

UT 

EN 

AH 

T 

I 

ME 

DI 

UM 

BO 

MB 

ER 





AD 

VA 

NC 

E 





FI 

FT 

EE 

N 


DI 

VE 

BO 

MB 

ER 





AG 

AI 

NS 

T 





FI 

FT 

H 


R OA 

DJ 

UN 

CT 

10 

N 




R 

AI 

LH 

EA 

D 





FI 

FT 

Y 

j 

R 

EP 

LA 

CE 

ME 

NT 



PR 

EP 

AR 

AT 

10 

N 



BR 

ID 

GE 

HE 

AD 


R 

ET 

RE 

AT 





A 

SS 

AU 

LT 






V 

IC 

IN 

IP 

Y 

S 

EV 

ER 

AL 





B 

OM 

BA 

RD 






W 

IT 

HD 

RA 

W 

JU 

NC 

TI 

ON 





A 

IR 

BO 

EN 

E 




A 

DD 

IT 

10 

NA 

L 

CO 

NF 

IR 

MA 

TI 

ON 



S 

EA 

BO 

RN 

E 



A 

MM 

UN 

IT 

10 

N 


I 

NF 

OR 

MA 

TI 

ON 


A 

DV 

AN 

Cl 

NG 





CO 

HD 

IT 

10 

N 


I 

NT 

EL 

LI 

GE 

NC 

E 



VI 

Cl 

HI 

TY 



RE 

CO 

GN 

IT 

10 

N 



PA 

TR 

OL 







DE 

TA 

CH 





E 

LE 

MB 

NT 



SA 

BO 

TA 

GE 






DE 

TA 

CH 

ME 

NT 




MI 

LI 

TA 

RY 


SE 

VE 

RE 





H 

AV 

EB 

EE 

N 






MI 

HI 

MU 

M 

AC 

TI 

VI 

TY 





M 

OV 

EM 

EN 

T 






HI 

NT 

H 


A 

TT 

EN 

TI 

ON 






EN 

EM 

Y 





P 

01 

NT 



S 

UC 

CE 

SS 

FU 

LL 

Y 



R 

ES 

ER 

VE 





T 

OM 

OR 

RO 

W | 










R 

ET 

UR 

N 






PO 

NT 

ON 



A- 

-- 

-- 

A- 






FL 

AN 

K 






RE 

QU 

ES 

T 


AR 

TI 

LL 

ER 

Y 





FO 

LL 

ON 






RE 

QU 

IR 

E 


AT 

TA 

CK 

ED 




B 

AG 

'r ft 

GE 






P 

RI 

SO 

NE 

R 

R 

EE 

NF 

OR 

CE 






HA 

SB 

EE 

N 




RE 

SI 

ST 

AN 

CE 

R 

EE 

NF 

OR 

CE 

ME 

NT 

A PP 

RO 

AC 

HI 

LO¬ 




D 

IS 

PO 

SI 

TI 

ON 

1 


ID 

EN 

TI 

FY 



DE 

BO 

UC 

HI 

NG 






PO 

SI 

TI 

ON 

j 


IM 

PA 

SS 

IB 

LE 


L 

AU 

NC 

HI 

NG 







SO 

UT 

H 

j 


IM 

PO 

SS 

IB 

LE 


I 

MM 

ED 

IA 

TE 

LY 




See subpar. 
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Two letters (cont.) 


-B -B 

-B -B 

-B-B 

IN IT IA TE 

P ER SO HN' EL 

I DE NT IF IC AT 10 N 

F IP TH 

ES TI MA TE DA T 

M EC HA NI ZE D 

TE RR IT OR Y 

P LA TO ON 

D EP LO YM EN T 

S IX TY 

S UP PL Y 

M ES SE NG ER 

M IS CE LL AN EO US 

S UP PO RT 

D ES TR OY ER 

E IE VA TI ON 

NA VA LB AS E 

A IR SU PP OR T 

E IE VE N 

F OR WA RD 

V IS LB IL IT Y 

LI AI SO N 

UI ND WA RD 

ME SS EN GE R 

BA MA GE 


I MP AS SI BL E 

MO RN IN G 

I 

I 

I MP OS SI BL E 

U NU SU AL 

C AS UA IT Y 

A NT IA IR CR AF T 

OB JE CT IV E 

P AT RO LS 1 

C OM MA ND IN G 

C OL ON 

B AT TL ES HI PS . 

OP ER AT 10 N 

C OL ON EL 

GE NE RA L 

PR IS ON ER 

SU PE RI OR IT Y 

W IL LA TT AC K 

PR OC ED UR E 

M OT OR IZ ED 

T RA NS MI SSllO N 

RE EN FO RC E 

OU TS KI RT S 

R EC OG NI TI ON 

TR AN SP OR TA TI ON 

EQ UI PM EN T 

T RO OP SH IP 

YE ST ER DA Y 

A VE RA GE 

RE GI ME NT 


B AR RA GE 

CA RR IE RS' 

-B.-B 

AI RC RA FT 

MI SS 10 NS 

R EC OM MS HD ED" 

AN TI AI RC RA ET 

TW EN TY 

HE AV YL OS SE S 

RE MA IN 

R EQ UE ST ED 

R EC OM ME ND AT 10 N 

R EQ UI RE ME NT 


C OM MU NI CA TI ON 

M IS SI NG 


R EC ON NO IT ER IN G 


Three letters 


> 

1 

> 

I 

> 

1 

A- A-A- 

-B -B -B 

N AV AL BA SE 

RE QU ES TE|D 

B OM BA RD ME NT 

R EQ UI SI TI ON 


EL EM EN TS 



EN GA GE ME NT 


Four letters 


AB A- -B 

A- AB -B 

A- -B AB 

H EA DQ UA RT ER S 

AD DI TI ON AL 

M OR NI NG 

EL EV EN 


P OS TP ON E 


A- AB-B , 


AB -B A- 

SO UT HN ES T 

A- -B -B — A- 

CA NC EL 


RE CO HN 01 TE R 

RE CO NN AI SS AN CE 

* 

1 

< 

l 

< 



W IT HD RA WA p 

A- -B — AB 

AB -B — A- 


32! TE RD IC T 

AD VA NC ED 

A- A-A- A- 


EII EM YT AN KS 

CO 121 All DI I7G 

A- -B -- A- -B 



S AT IS FA CT OR Y 

AB — A- -B 

A- A-B -B 


SI GH TI NC- 

RE QU IR EM T 

A-A- C- C- 



| DI SP AT CH ES 

RESTRICTED- 
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A-C- A- C- 

Four letters (cont.) 

-B A- -- AB 

-B -D -D -B 

RO AD JU NC TI OR 

-B AB A- 

U RS UC CE SS FU L 

i 

-B A-A- -B 

AI RS UP PO RT 

-B -D-D -B 

DI SP OS EE 10 N 

P OS IT 10 N 

PR ES EN T 

RE PR ES Eg % 

-B A- AB 

m DI ,UM BO I© ER 

1 

-B A- -- -B A- 
VI SI BI LI TY 

-B A.AB 

IN ST RU CT 10 N 

C ON ST HU CT 10 N 

-B — A- AB 

F IG HD ER PL AN ES 

-B — A-AB 

E ST AB LI SH ME NT 

-B-B A- A- 

EN CO UN TE RE D 

-B-B -D -D 

RE IN FO RC Eli EN T 

RE PE AT ED 

-B A- A- -B 

IH FO R!i AT 10 H 

-B A-A- -B 

DE ST RO YE R 

-B A- -B — A- 

IN ST AL LA TI OR 

-B -D -B-D 

UR ID EE TI FI ED 

CR OS SR OA DS 


/ 

Five letters 


A- -B AB — -B 

-B A- A-AB 

9 

t 

1 

NA VA LA TT AC K 

DI ST RI BU TI OR 

IN ST RU CT 10 NS 

A- -B-B AB 

-B A- -B AB 


R EC OR NA IS SA NC E 

RE PL AC EM EN T 



Six letters 


AB CB C- A- 

A- A- -B AB A- 

A-CB A-CB 

P OS IT 10 NS 

AB -D -D AB 

RE QU IS IT 10 N 

A- CB — A- CB 

ID EN TI FI CA TI ON 

-B AE AD -D 

C ON DI TI ON 

RA DI OG RA M 

Q UA RT ER MA ST ER 

A- CB — CB A- 
SC HO OL HO US E 

A DM IN IS TR AT IV E 


Seven letters 


-B AS-B -D AD 

BE EN FO RC EM EN T 


Eight letters 


AB -B AD-B AD 

AB -B C- AB CB 

AB -D C- AD C- -B 

QU AR TE EM AS TE R 

EM PL AC EM EN T 

IN TE KD IC TI ON 
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APPENDIX 4 


. , - , SERVICE TERMINOLOGY & STEREOTYPES ,, ‘ ’, 

r - ^ ^ 

Familiarity with the style and peculiar phraseology which exist In 

military messages greatly facilitates the cryptanalytic recovery of the 
plain text of any such messages which have been encrypted. Thus* this 
appendix has been compiled to comprise notes on those idiosyncrasies 
present in military messages which are of particular interest end aid 
to the cryptanalyst. The notes which are applicable to the messages 
of all Services are grouped together in Section A, those which are 
applicable to messages of ground, naval, and air origin, respectively, 
constitute Sections B, C, and Dj those which apply to special types of 
messages, such as weather messages, are contained in Section Ej and 
remarks on stereotypic beginnings and endings of messages comprise 
Section F. 


Although the notes contained herein derive primarily from U. S. 
military communications, many apply as well to the military communi¬ 
cations of other countries. At the very least, this appendix Indicates 
the types of information on message style and phraseology which, when 
known concerning the messages of any source, can be quite helpful in 
the cryptanalysis of such messages. 


r i 


- -i* 
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A* REMARKS APPLICABLE TCf^lSAfiS© fa^ll&B^gjlCFS 


1* When mention Is made of time In military messages. It Is? con¬ 
ventionally specified In terms of the 2U -hour clock system (ending at 
midnight). In which time Is expressed as a group of four numerals. The 
first two numerals of the group denote the hour end the last two numer¬ 
als, the minute after the hour; for example, 6:00 AM appears as 0600 
and 6:00 PM appears as 1800. For any current month, the day may he Indi¬ 
cated by prefixing the four-digit time group with a two-digit date group, 
indicating the day of the month; for example, 080600 denotes 6:00 AM on 
thfe 6 th day of the month. In some Instances, a four-digit time group or 
elx-dlglt date-time group occurring in a message may he found with a 
literal suffix, giving rise to such groups as 1800Z, 080600 Q, etc; this 
suffix may he any one of the letters A to X or K to Z end is a type of 
designator used In communications practices to refer to a particular one 
of the 24 time zones of the earth. 


2. Administrative messages In general often contain many sequences 
of numbers, brought about by numexous references to previous messages and 
to various Service regulations (among other items), reference generally 
being made on the basis of identifying serial numbers and dates which 
such Items usually bear; specific illustrations of this fact appear In 
several of the succeeding paragraphs in this appendix. Furthermore, 
administrative messages contain references to items having literal desig¬ 
nations; to minimize errors In this connection such designations arc 
often spelled out phonetically, by means of a" phonetic alphabet, such as 
one' of the following: 


- ABLE 

JIG 

SUGAR 

ALFA 

JULIETT 

SIERRA 


KING 

TARE 

KILO 

TANGO 

1 BAKER 

BRAVO 

LOVE 

UNCLE 

COCA 

LIMA 

CHARLIE 

UNION 

DOG 

MIKE 

VICTOR 

DELTA 

METRO 

VICTOR 

EASY 

NAN 

WILLIAM 

ECHO 

NECTAR 

TJHISKEY 

" FOX 

OBOE 

XRAY 

FOXTROT 

OSCAR 

EXTRA 

GEORGE 

PETER 

YOKE 

GOLF 

PAPA 

YANKEE 

HOW 

ITEM 

QUEEN 

ROGER 

ZEBRA 

I?0TEL 

INDIA 

QUEBEC 

ROMEO 

ZULU 


\ 

MW 

■*! ** 

- ■v 


3 , The messages of all Services exhibit a high content of abbrevi¬ 
ations; for this reason, the following list of frequently-encountered 
abbreviations is included: 


WAVY OFFICER RAMS 

FARM....fleet admiral 

ADM.admiral 

VADM....vice admiral 

RADM....rear admiral 

COI10. . • • commodore 
CAPT • ...captain 
CDR.....commander 

LCDR....lt. commander 

LT......lieutenant 

LTJG....lieut. Jr. grade 

EHS.ensign 


ARMY. AIR OFFICER RAMS 

GEN......general 

LTGEN....11. general 

MAJGEN.. .major general 
BRIGGEN..brigadier general 
COL......colonel 

LTCOL....It. colonel 

MAJ. • • • • .major 
CAPT.....captain 

1ST LT...first lieutenant 
2ND LT...second licut-i rnt 
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PUNCTUATION 


_MISCELLANEOUS 


CLN...colon 
GIIA-. .comma 
PARA..paragraph 
PAREN.parenthesis 
PD....perrod 
RPT...repeat 


CG»...commanding general 

CO....commanding officer 

COM. •.commander 

COMDT.commandant 

DET. • .detachment 

ETA...estimated time of arrival 

ETD...estimated time of departure 

GMT...Greenwich mean time 


HQ. • .headquarters 
LAT. .latitude 
LONG•longitude 
LTR..letter 
MSG..meooage 
NR...number 
RE...reference 
UR...your 


> 


it. The identity of the person originating a military message may 
appear as a signature at the £hd of a giesongo and the^ addressee's Identity 
may appear at the beginning; or either, or both, of these may be "burled 11 
in the middle of the message, set off by parentheses* If the signature 
is at the end of the messagc,_lt may be preceded by STOP (or PERIOD or 
SIGNED, or bolb. The identification of the originator or addressee may 
consist merely of his command designation (e.g., COMMANDING GENERAL, 

FIRST ARMY) or it may consist of his name and rank, followed by COMMANDING 
or some other appropriate amplifying date^ (e.g., in the Army, his branch 
of service). . w - = v * , * 


Examples: 

JONES, COLOIfEL, ARTILLERY 


A £ t~ "* «O l *. **■— ^ . * ,l 

* t 

r t V " f ^'V, V W- ► 


* 


COMMANDING OFFICER, THIRD REGIMENT 
COMMANDER, DESTROYER SQUADRON SIX ’ 


SMITH, FLIGHT I.EADER, SECOND SQUADRON 


5. In military communications, long messages are often broken into 
parts, each part subsequently being treated as a separate message. Thus, 
messages arise which begin "PART (number) OF (number) PARTS", or "(number) 
PART MESSAGE PART (number)", often separated from the following message 
text by STOP or simply by an "X". 


B. REMARKS ON ITEMS APPEARING IN GROUND (ARMY) MESSAGES 

I 

1. When mention of an array unit appears in a military message, its 
size (echelon) is indicated, generally preceded by a numerical or literal 
designation and, as further information concerning the unit, its branch 
of service may be included. The several echelons of the U. S. Army, 
listed in descending order of size, are: army, corps, division (DIV), 
brigade, regiment (HEGT), battalion (BN), company 3 - (CO), platoon. Some 
of the branches of service which may appear, as mentioned above, are: 
Infantry (INF), Artillery (ARTY), Signal Corps (SIG C), Armor, Ordnance 
(ORD), Eagineers (ENG), Quartermaster (QM). 

Examples of unit designations: 

(a) "A" Company, 39th Infantry Regiment, 9 th Infantry Division 


1 An artillery unit at this echelon is termed a battery . 

*-3 
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(b) 1st Armored Division 

(c) 57th Signal Battalion 

2. In connection with 1, above, an army Is the normal coaronod or a 
gencial (four stars); a corps being the command of a lieutenant genes.-»1, 
a division, the command of a major general; ond a brigade, the comn.-'»& of 
a brigadier general# A regiment is normally commanded by a colonel, a 
battalion may be commanded by a'lieutenant colonel or a major; a company, 
by a captain, and a platoon, by a lieutenant. 

3* For reference purposes, when giving locations of units, readily- 
recognizable landmarks such as hills, crossroads hnd road Junctions may 
he referred to in terms of their altitude above sea level (in number of 
foot), if such landmarks do not bohr proper names which are suitable fex 
the purpose. Thus, a reference. In a military message, to CROSSROADS 
SIX FIVE ZERO would apply to that particular crossroads within r pre¬ 
selected area which Is located at on hltltude of 65 b feet# If, within 
any preselected area of reference, there are two or more landmarks of any 
given type which are both at the camp ^altityde, It is necessary to affix 
a distinctive letter or number to the-altitude designation of each. In 
order to clearly identify a particular one, thus, such a reference as 
CROSSROADS SIX FIVE ZERO DASH /hyphen/ B may he encountered# In this 
connection, CROSSROADS may he found abbreviated as "CR", and RQADJURCTION 
as "RJ". . . 

t*. 

4, The location of any particular unit may he specified in terms of 
itB location with respect to a particular place or locality, or to a 
particular landmark. However, its location may also be specified by 
Gtating how it is located on a specific map or portion of a imp* This 
gives rise in military messages to phrases such as COORDINATES ODE FIVE 
POINT TWO FOUR BASH ONE NINE FOUR POINT SEVEN, wherein the numbers before 
the "dash” indicate the unit*B location with respect to the horizontal 
grid lines of a preselected map and the numbers after the n dash fl indicate 
its location with respect to the vertical grid lines. 

5* Specific highways, turnpikes, and other roadways arc often 
identified in military messages by stating the place names of their termi¬ 
nal points; thus the highway running between Grizurbeto and Bolzano could 
be called the GRIZURBETO DASH /hyphen/ BOLZANO HIGHWAY. Similarly, when 
reference is made to an imaginary straight line across the terrain in a 
poi fcicular area, such a line may he identified by specifying any recog¬ 
nizable landmarks between which the line runs, for example, LUTE CROSSROADS 
'HIRES ONE FIVE DASH ROAD JUNCTION TWO NINE EIGHT. „ , , 

6. Included below is a brief list of frequent words appearing in 
3ov-echelon ground traffic, the abbreviation for certain ones are appended 
in prrentheses after them. In addition to the wordG listed below, numbers 
ond rcnks/titlas will be found to have a high frequency of occurrence. 

ACROSS ADVANCE AIRPLANE 

ACTIVITY ADVISE AK^ITION'(At" O) 

ADDITIONAL AFTERNOON AREA 
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ARMORED 

„ HILL 

RADIO 

ARMY 

HOSTILE 

RAILROAD 

ARRIVE _ 

IDENTIFICATION (IDENT) 

READY 

ARTILLERY (ARTY) 

IMMEDIATELY 

RECEIVE 

ATTACK 

BAREAC-E 

INFANTRY (INF) 
INFORMATION (INFO) 

RECONNAISSANCE (RCN) 
REFERENCE (RE) (REF) 

BATTALION (BN) 

LARGE 

REGIMENT (REGT) 

BRIDGE 

LEFT 

REINFORCEMENTS 

CAPTURE 

LIGtfT 

REPORTS 

CASUALTIES 

LINE 

REQUEST 

COMMA 

LOCATION 

REQUIRE 

COMMAND POST (CP) 

MACHINEGUN (MG) 

REQUISITION 

COMMUNICATION (COMM) 

MESSAGE (MSG) 

RESERVES 

COMPANY (CO) 

MORNING 

RIGHT 

CONCENTRATION 

MORTAR 

RIVER 

COUNTERATTACK 

MOUNTAIN 

ROAD 

CROSSROADS (CR) 

MOVE 

ROAD JUNCTION (Rjr) 

DAILY 

MOVEMENT 

ROCKET 

DASH 

NEAR 

SEND 

DEFEND 

, NEUTRALIZE 

SMALL 

DEFENSIVE 

TOHT 

SOUTH 

DISPOSITION 

NORTH 

STOP 

DIVISION (DIV) 

NOTHING 

SUPPLY 

EAST 

-OBJECTIVE _ 

SUPPORT 

EMPLACEMENTS 

OFFENSIVE 

TANKS _ . 

ENEMY 

C^FICER 

TODAY 

ENLISTED PERSONNEL 

ORDER 

TOMORROW 

FIRE 

r PATROL 

TONIGHT 

FLANK 

PLANE , 

TROOPS 

FORCE 

POSITION 

VICINITY 

FROM (FM) 

PREPARE 

WEST 

HEADQUARTERS (HQ) 

PRISONER 

WOODS 

HEAVY 

PROCEED 

YESTERDAY 


C. REMARKS GN ITEMS APPEARING IN NAVAL MESSAGES 

1. Mention of various sized groupings of vessels are found in 
messages of naval origin, among which those mentioned below are quite 
frequently encountered, A major naval force is called a fleet, and the 
levels of echelonmenb (or subdivision) within a fleet are the task force, 
task group, and task unit (in descending order of size). The basic unit 
of all fleet vessels is termed a division, and is comprised of two or 
more vessels of the "same type;* in this connection, when mention is made 
of a division in a naval message, the particular type of vessel 6f whlcli _ 
the division is made up is often specified; for example, CRUISER DIVISION* 
A squadron is made up of two or more divisions of submarines, destroyers, 
landing ships or other light vessels, and a flotilla comprises two or 
more such squadrons. 
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if 

2. In connection with 1, above, a fleet la normally commanded by cn 
admiral (four stars), a tack force being the command of a vice admiral; 
and a task group, the command of a rear admiral* Furthermore, In time of 
war the officer In command of a convoy or flotilla often holds the rank of 
coramodorc; the officer commanding an Individual chip may range In rank 
from captain on down, depending on the type of ship. 

3* A list of the main combat vessels is Included below, the approx¬ 
imate maximum speed of each, which is expressed in KNOTS, is shown in 
parentheses- 1 


BATTLESHIP 

CARRIER 

CARRIER ESCORT 
CRUISER 
DESTROYER 
DESTROYER ESCORT 
SUBMARINE 


(35) 

(35) 

(15) 

(30) 

(35) 

(25) 

(20, on surface; 
10, submerged) 


When a particular vessel is mentioned in a naval message, it may be identi¬ 
fied by a numerical designation, by a group of letters, or by some proper 
name. 


4. In naval messages, the direction of an object from a ship, or the 
course of a particular naval vessel or unit at sea lo given as a compass 
bearing expressed in degrees (from 0 to 359)# for example, BEARING ZERO 
EIGHT FIVE. In some instances the statement of a bearing will be followed 
by the word TRUE or MAGNETIC, indicating that the bearing is measured from 
tho geographical pole (true north) or the magnetic pole (not corrected for 
variation), respectively. 

3* The position of a particular naval vessel or unit at sea may be 
specif led In a naval message by stating Its latitude and longitude in 
degreos and minutes. The latitude may be from 0 to 90 degrees and the 
longitude from 0 to 180 degrees, a specified latitude is generally followed 
by NORTH or SOUTH (as appropriate) and, similarly, longitude is followed by 
EAST or UEST. For example, LATITUDE THREE ZERO DEGREES TOO ONE MINUTES 
NORTH LONGITUDE ONE FOUR TOO DEGREES ONE SIX MINUTES WEST, or more briefly 
LATITUDE THREE ZERO DASH TOO ONE NORTH LONGITUDE ONE FOUR. TOO DASH ONE SIX 
WEST. If.position is stated in conjunction with a bearing, it is not 
necessary*t"o'state both latitude and longitude, and the location, NORTH or 
SOUTH, with respect to the equator or EAST or TOST with respect to Green¬ 
wich Meridian may be' found omitted where no ambiguity arises. Positions 



\ > 1 «* 
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6. The following words may be expected to appear frequently In 
a selection of naval messages of various typesj 


AIRCRAFT 

EXECUTE 

RADAR .. 

ALTITUDE 

FLEET 

REJOIN " 

BEACH 

' ■*- T FLIGHT " 

RENDEZVOUS 

BLOCKADE 

, FORMATION 

SAiL 

BOMBED 

FUEL 

A SEA 

CARGO 

GUARDING 

SHIFT 

CHANNEL 

HARBOR 

.. .. - s SHIP , 

COASTAL 

KNOTS 

SORTT 

COttlAND 

LATITUDE 

SQUADRON 

CONTACT 

LONGITUDE 

w SyARJ30ARD 

COiTVQX 

MILES 

STRAFED 

CORRECTED 

MINE (FIELD) 

STRAIT 

COURSE 

MISSION 

TARGET 

CRAFT 

NAVAL 

VESSELS 

DEPART 

NAVT 

VIA * 

DEPLOT 

OPERATIONS 

VOXAGE, 

EMBARK 

PILOT 

WEATHEp r ~’ 

ESCORT 

PORT 



D. REMARKS ON ITEMS APPEARING IN AIR MESSAGES 


1. The various elements of which an air force is composed and which 
may be mentioned.in air messages are given below. The smallest'grouping 
of aircraft, composed of one or more aircraft of a particular type, is 
called a flight « A squadron is two or more flights of the same type; a 
group Is made up of two or more squadrons, a wing comprises two or more 
groups, an air division is composed of two or more wings, and two or ntore 
divisions constitute an a?r force. 

mm * ■> m i i mm > __ * X 

2. In connection with 1, above, a flight is usually commanded by a“ 
major, a squadron being the command of a lieut. colonel,^ a group^ being 
the command of a colonel, a "wing, jfche commancT of a brigadier* general;" 
and an air force, the command of a major general. 

3. Some of the types of aircraft which_ may be mentioned frequently 
in air messages are listed below, an Indication of the range of speed of 
each type, expressed in KNOTS, is shown in parentheses. 


BCIBER 
CARGO PLANE 
FIGHTER 
JET BCCJBSR 
JET FIGHTER 
LIAISON PLANE 


(250-400) 
(150-350) 
( 250 - 500 ) 
(350-600) 
(250-500) 
( 65-150) 




4. The position of a particular aircraft may be specified in an 
air message by stating its latitude and longitude in degrees and minutes, 
sometimes in conjunction with its altitude in feet. (Latitude may be 
from 0 to 90 degrees and longitude from 0 to 180 degrees.) 
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1. Weather meosages . Any generalization on the specific elements 
which a weather message will contain would perforce be rather tenuous, 
the contents of a particular weather message generally being dependent 
on its source and purpose* However, there are certain elements which 
may be expected.' to appear in most weather messages, these are listed 
below with an indication of the terms in which each is generally expres¬ 
sed: 

-. * \ 

a. Identification of the originating station (by code number, 

or location). ' T 

b. ‘ Wind speed (knots) and direction (tens of degrees, from 

00 - 36 ). 

c. Amount of r low plouds (tenths of sky covered) and their 
height (hundreds of,feet). 

d. Types Qf low, medium, and high clouds (e.g., cumulus, 
stratus, cirrus, etc.). 

e. Temperature of the_afr apd temperature of the dew point 

(both in degrees Fahrenheit). 

* 

"f.*~Present and past weather (e.g., clear, partly cloudy, 
cloudy or overcast, fog, drizzle, rain, snow, showers, thunderstorm, 
etc.). 

a* -I 

g. Horizontal visibility (miles). 

h. Atmospheric pressure (tens, units, and tenths digits in 
millibars) and barometric tendency (e.g., falling, steady, rising, etc.) 

2. Air-to-ground position reports . Position reports made by air¬ 
craft in flight may be expected to contain the majority of the following 
elements of information: 

a. Position of the aircraft (in latitude and longitude or with 
respect to some locality on the ground). 

b. Time. 

c. Speed. 

d. Altitude. 

e. Weather conditions. 

f1 Estimated time of arrival at next reporting point or at 
destination. , , ^ „ 
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P. STEREOTYPIC BEGINNINGS AND ENDINGS 


Within the confines of the comparatively limited scope of military 
messages, stereotypy of phraseology Is inevitable* Particularly In the 
beginnings of messages is this limitation apparent} thus these positions 
lend themselves most readily to attach by the cryptanalyst. The follow¬ 
ing list of stereotypes have a high frequency of positional occurrence, 
and therefore may provide a fruitful source for cribs* It Is to be noted 
that a stereotypic initial word often may suggest a whole opening phrase* 
For example. If a message of low-echelon ground origin begins with the 
word HEAVY, then it Is not too unlikely that the opening phrase Is "HEAVY 
ARTILLERY (FIRE, BARRAGE) (FALLING, INTERDICTING)...*," which might be 
expanded Into "HEAVY ARTILLERY FIRE FALLING ON OUR POSITIONS (NORTH, EAST, 
SOUTH, WEST) OF...." 


BEGINNINGS 


ACKNOWLEDGE 

ADVANCE 

ADVISE (THIS COMMAND) 

(THIS HEADQUARTERS) 

ARRIVE 

ATTACK 

ATTENTION 

CANCEL 

CITE 

CO:25ANDING (GENERAL) 

CC. 2 IUNICATION (OFFICER) 

CONCENTRATE (-ION OF) 

CONFIRM 

CONTINUE 

DEPART (-URE) 

DISCONTINUE 

EFFECTIVE 

ENEMY 

EQUIPMENT 

EXPEDITE (SHIPMENT) 
FOLLOWING (ARE) (IS) 

FOR 

FROM 

HEAVY 

HOSTILE 

INFORM (-ATION) 

IN REPLY (TO YOUR) (MESSAGE) 
LOCATION (OF) 


NUMBERS (1, 1st, 2, 2d, etc.) 

ORDERS 

OUR 

PARAPHRASE 

PLEASE 

POSITION 

PREPARE (TO) (-ATIONS FOR) 

PROCEED 
• RECEIPT 
RECEIVE 

RECOMMEND (-ATION) (-ED) 

REFER (-RING) (TO) (YOUR) 

REFERENCE (YOUR, MY) (MESSAGE, RADIO¬ 
GRAM, TELEGRAM) (NUMBER) (DATED, OF) 
REPEAT 
REPORT 
REQUEST 
REQUIRE 
RERAD 
REURAD 
SEND 

SITUATION REPORT 
STATUS REPORT 
SUPPLY 
VERIFY 

YOUR (COMMAND) (ORGANIZATION) 


ENDINGS 


ACKNOWLEDGE 
ADVISE (IMMEDIATELY) 

CONFIRM 

END 

END OF MESSAGE 
IMMEDIATELY 

NUMBERS (l, 1st, 2, 2d, etc.) 


PERIOD 
REPLY 
REFERENCE 
REQUESTED 
SIGNED (NAME) 

STOP 

TITLES (MAJ, COL, etc.) 
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APPENDIX 5 

LETTER FREQUENCY DATA - FOREIGN LANGUAGES ^ 

. ^ 1 J - 5 -w _ J 

- - * i r > t 

* 1 i *" f - r 

"■ I (l M ' 


Section Pages 

A* German letter frequency data..».»..» 3*6 

B. French letter frequency data.. »••••••••••••• 7->10 

C. Italian letter frequency data........ 11-14 

D. Spanish letter frequency data••••..... 15-18 

E. Portuguese letter frequency data. 19-22 

F. Russian letter frequency data... 23-26 
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flhe letter frequency data contained herein has been compiled from 
selected newspaper and magazine articles comprising war communiques and 
other military-type text* In the material which was processed there 
were place names and words foreign to each particular language, these 
words account for the presence pf certain non-characteristic letters In 
the data given herein for those languages which make use of the Roman 
alphabet* 
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A. GERMAN LETTER FREQUENCY DATA 


1-a. Absolute frequencies of single letters of OeMW plain texb, arranged 
alpho.beblcally, based onT^OjO^b lotters"of text * (The ietiera X and Y ‘are ' 
derived from foreign words contained in German plain text). 


A 

3,601 

G 

1,921 
2,47 J 

L 

1,988 

Q 

6 

V 

523 

B 

1,023 

H 

M 

1,360 

R 

4,339 

w 

099 

C 

1,620 

I 

4,879 

N 

6,336 

S 

4,127 

X 

12 

D 

3,248 

J 

192 

0 

1,635 

T 

3,447 

Y 

24 

E 

10,778 

K ‘ 

747 

P 

499 

U 

2,753 

Z 

654 

F 

958 
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l-b. Monographic kappa plain, German languag e ■ .0787 


l-c. Frequency distribution of single letters baaed on 60,046 letters of 
German plain text, reduced to ],000 letters, arranged according to their 
relative frequencies. T 


E 

180 

T 

57 

G 

32 

F 

16 

N 

106 

D 

54 

0 

27 

W 

15 

I 

81 

U 

46 

C 

27 

K 

13 

R 

72 

H 

4i 

M 

23 

Z 

11 

S 

69 

L 

33 

B 

17 

V 

9 

A 

60 




' 




8 

3 


__ _ . 1,000 

1-d. Percentage of vowels, high-frequency consonants, medium frequency con ¬ 
sonants, and low-frequency consonants in 66,046 letters of German plain text . 
Percentage of 8 most frequent letters in German plain text. 

' r.r™rr.n _ m 1 - * , 1 ''i " I n .1 ... 1 1.1™ t 1 r n l 

Vowels A,E, r l,0’u,' rnd Y = 39M 
High-Frequency Consonants D,N,£,S, and T ■ 35*8$ 

MediunwFrequedcy Consonants 3$,C,F,G,H,L,M, and W * 20.4$ 

Low-Frequency Consonants J,K,P,Q,V,X, and Z « 4.4$ 

- *■ i v * * - * - **- - —— — , ■ . 

I t i | f i 1 

•r (In descending order Of frequency) 

1 V m / *“ -*X > f 

- * 8 most frequent letters E,b,T,R,S, A,T, and D = 67.9$ 

1 » ' 

^ ^ * 

l-e. Absolute frequencies of single letters as initial letters of 9.568 
words in German plain text, arranged according to their frequencies . 


D 

1,716 

U 

550 

Z 

343 

K 

263 

A 

762 

N 

544 

M 

339 

P 

181 

S 

698 

G 

46l 

N 

306 

R 

167 

E 

686 

B 

46o 

F 

280 

L 

158 

I 

581 

V 

4o8 

H 

265 

J 

135 
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2 -a. Frequency distribution of dlrrapho baaed on 60,046 let lex - a of 
German plain text, reduced to 5,000 digraphs * ~ 

ABCDEFGH1JKLMM0PQRSTUVWXYZ 











£ 1 
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raphlc kappa plain., German language « .0111 


2~c. The 95 digraph s cotrc prlpi 


—Zm nM . 

l62SE9HESSSS 


of German plain text, based on the 


according to their relative 





EN 

235 

WE 

58 

IE 

911 

EI 

90' ES 

68 j se 

461 

ER 

195 

RE 

60 

IN 

84 

NI 

35 IS 

53! SI 

30 j 

DE 

127 

ED 

31 

GE 

78! 

EG 

32 




2-e. Frequent digraphs in German plain text vhoso reversals are rare or in 


• , - * 

r " "'*■ ~V \ —T-* 

CH 130 j HC 0 j KD 113 ! DN 2 | NO 57 { GN 3 | SC 55 j CS 0 


2-f. Doublets occurring in German plain text,, arranged according to their 


frequencies from the table 


ss 

40 

t 

EE 

13 

FF 

7 

RR 

61 

gg’ 

\ 

2 1 

\ 2 1 

LL 

20 

MM 

11 

TT 

7 

AA 

4 1 

11 

2 1 

HH , j 4 j 

NN 

18 

DD 

8 







-i 


2-g. The 22 digraphs appearing 100 or more times as ^beginnings of words ip 
9>568 words in German plain text, arranged according to their-absolute fgq.- 


DE 

§05 

EI 

300 

DA 

*244 

WE 

192 

ER 

153 

'w 

’1*24 

ST 

DI 

567 

GE 

299 

VO 

214 

VE 

172 

HA 

l4o 

mx 

117 

IN 

UN 

428 

BE 

252 

SI 

197 

HI 

155 

AL 

134 

SN 

112 

SE 

AU 

318 










* 

- 


T— ’ * t ♦ 3 # „ 

The 10 digraphs above this line compose 25/6 of German plain textY 

2 The 37 digraphs above this line compose 50 56 of German plain text. 









































SCH 

666 

ERE 

313 

HEN 

198 

AUS 

162 

1ST 

142 

HRE 

124 

DER 

602 

EN3 

270 

SSE 

191 

T1S 

159 

STA 

l4l 

UER 

122 

CUE 

599 

CRT 

26-4 

KSI 

190 

DER 

157 

DES 

l4o 

ACH 

119 

DIE 

56*1 

NGE 

263 

TER 

188 

ENI 

157 

FUE 

139 

GES 

118 

NDE 

541 

NDI 

259 

REN 

185 

ENG 

155 

LTD 

139 

ABE 

117 

EIN 

519 

IND 

254 

EIT 

184 

ION 

154 

UER 

138 

ERA 

117 

END 

481 

ERD 

248 

EBE 

178 

SEN 

152 

ERU 

137 

BFN 

116 

DEN 

457 

INE 

-247 

FI® 

175 

ITI 

151 

TON 

136 

I JEN 

115 

ICH 

453 

AND 

246 

LIC 

175 

AUF 

149 

SEX 

133 

RIE 

112 

TEN 

425 

RDE 

239 

EGE 

173 

IES 

149 

ESE 

132 

VER 

110 

UNO 

377 

ENA 

214 

DAS 

172 

ASS 

148 

ERT 

128 

LAN 

109 

HEN 

332 

ERS 

212 

ENG 

171 

EOT 

148 

NDA 

127 

EBB 

108 

UKD 

331 

EDE 

209 

HUN 

169 

ENT 

146 

IED 

126 

ESS 

108 

GEN 

321 

STE 

205 

NCR 

166 

ERI 

143 

ERIT 

125 

LLE 

108 

ISC 

317 

VER 

204 

RUN 

163 

EST 

142 






VAU 108 

tsc 107 

EIW DO6 
ERG 106 
RIT 106 
EHR 105 
CEA 104 
VON 104 
SIC 103 

XOE \Q2 
ITE 101 
ENZ 100 
EKB 100 
EOT 100 


3-b* The 2 5 trig r aphs appearing 50 or mere t lines as beginnings of vords 

In 9 


frequencies. 


assssm 


Eaaal 


DAS 79 

SCH 73 

AUF 64 

DEU 6l 

UKT 57 

BRI 79 

AUS 69 

NER 63 

GES 60 

GRO 56 

DIE 76 
NIC 73 

SEI 68 
STA 65 

IDD 62 
ALL 61 

GEG 59 

AUC 55 


4. The 121 totr a qraphs appearing $0 or wore limes in 60,046 letters of 
German plain text, arranged according to their absolute frequencies 


SCI® 398 TSCH 107 ENIN 80 RITI 66 NERD 6l DIEB 54 

ISCH 317 NUND 106 NICH 80 ATIO 65 RSCH 60 ENZU 54 

CKEN 296 HIS 104 UNGD 80 GENO 65 EDEN 59 XTEN 54 

NDER 243 SIGH 103 EITE 79 TEND 65 ERGE 59 KRIE 54 

EIRE 218 RUNG 101 DEUT 78 EEER 64 ESSE 59 RIEG 54 

ENDE 216 ANDE 100 FUER 78 GEGE 64 UKTE 59 SDIE 54 

NDIE 176 UNGE 100 CffiEE 77 FOLI 64 EICH 58 URCH 53 

LICH 168 EREI 94 EGEN 7 6 SIND 64 TLIC 58 ALLE 52 

ICHT 151 TION 93 HEIN 7^ TUNG 64 INER 57 DERS 52 

TISC 146 SEIN 92 IESE 75 FKSI 62 EBEN 56 EKWE 52 

ERDE l44 IEDE 91 ERST 74 FUTS 62 ENDA 56 HABE 52 

EIIDI l4l LAND 91 RDIE jk LITI 62 ENST 56 OIIEN 52 

NDEN 136 SSEN 90 ERDI 72 UEBE 62 IGEN 56 SCHI 52 

RDEN 133 BRIT 89 STLN 72 UTSC 62 ONDE 56 DEED 51 

ENUN 120 DASS 86 CHER 71 AUCH 6l TENS 56 DISC 51 

ICEE 120 HTER 86 INDI 71 DEES 6l EDIE 55 ENEN 51 

IHDE 111 EDER 83 REIN 71 EIND 6l ERTE 55 1IACH 51 

NGEN 110 EREN 83 DERE 70 O^IT 6l HREN 55 NBAS 51 

ERUN 109 EKGE 8l NGDE 70 SOHA 6l TDIE 55 TJEGS 51 

DIES 108 ENAU 80 ENBE 68 SCHL 6l ATEN 5 1 ' AVEN 50 

KBER 50 

*5." A verage length of -words in German plain text = 6.3 

5-6 

- -1 ? * 
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B. FRENCH LETTER FREQUENCY DATA 


1-n. Absolute frequencies of single letters of French plain text, arranged. 


on 55,75# letters of text. 



4,480 

4o6 

1,944 

2,198 

9,334 

646 




6 

624 

1 

L 

’ 2,737 

Q 

616 

4 

V 

801 

H 

276 

M 

1,617 

R 

4,117 

w 

6 

I 

4,230 

N 

4,406 

S 

4,564 

X 

317 

J 

184 

0 

3,255 

T 

4,057 

Y 

100 

K 

25 

P 

1,689 

U 

3,045 

Z 

84 


$0 = .0777 


Frequency distribution of single letters based on 55.758 letters in 
French plain text reduced to 1,000 letters, fn-cl arranged according to their 



167 

T 

73 

t 

c 

35 

G 

* 11 

J 

J r 

3 

s 

82 

0 

58 

p 

30 

Q 

11 

Y 

2 

A 

80 

'U 

55 

M 

29 

B 

7 

Z 

2 

N 

“79 

L 

49 

V 

l4 

X 

6 

K 

1 

I' 

76 

D 

39 

F 

12 

n 

5 

W 

- 

R 

74 







- 

1,000 


1-d. Percentage of vowels, high-frequency consonants, med i um-frequency 50n 
sonants, and low-frequency consonants in 55,758 letters of French plain text , 
Percentage of 8 moBtfrequent letter's In French plain text * 


Vowels A,E,I,6,U, ana Y = 43.8$ 

* " High-Frenqueuey Consonants N,R,S, and T = 30.7$ 

Medium-Frequency Consonants C,S,L,M, and P * 18.3$ 

Lcw-Fr^quenfc^ Consonants B,F / G,H, J,K,Q,V,W,X, and Z = 7*2 

* - * * - ■ * ^ — * : - I / ' Vj, 

(in descending order of frequency) 

ni.i, 1 - j - 

8 most frequent letters* E,S,A,N,I,R,T, and 0 **68.9$ 


1-e. Absolute*frequencies of single letters as initial letters of 10.748 


words in French plain text, arranged according to their frequencies. 


letter words have been omitted). 


D 

1^45 

L 

784 

I 

315 

U 

2k 0 

H 

67 

P 

929 

S 

664 

F 

313 

0 

177 

Z 

7 

E 

894 

Q 

394 

T 

305 

G 

146 

K 

5 

A 

866 

R 

389 

N 

278 

B 

115 

W 

3 

C 

816 

M 

337 

V 

263 

J 

98 

Y 

3 


9,^53 


5-7 
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2 -a. Frequency distribution of digraphs baaed on 55,758 letters of 
French plain text, reduced, to 5,000 digraphs* 



A 

B 

c 

D 

E 

F 

G 

H 

I 

j 

K 

z4 L«tfor 
L M N 

0 

p 

Q 

R 

s 

T 

u 

V 

w 

X 

Y 

z 

A 

2 


20 

11 

4 

6 

II 


30 

i 


36 

12 

68 

1 

21 

3 

41 

□ 

46 


13 

B 


2 

jj 

B 

s 




4 




4 



12 



4 


■ 

5 

2 

1 

2 

■ 

B 

B 

B 

B 

C 

i 


6 


47 



II 

20 

" 1 


3 



48 


B 

4 

1 

8 

8 



B 

B 

B 

D 

18 



1 

101 



1 

20 

1 



1 


IO 

1 


6 

2 


26 

" 




B 

E 

*30 

4 

41 

4-8 

30 

»* 

14 

3 

13 

3 


S6 

38 

I05T 

4 

38 

i 

87 

IS4 

_ 

58 

__ . 

27 

17 


8 


B 

F 

IO 


2 

1 

1 

6 



S 



1 



8 

1 

B 

10 

« 


1 






G 

c. 




16 


1 


Z 



3 

1 

7 

6 

B 

B 



4 

2 


B 

B 



H 

u 




6 


B 


4 



B 



3 



1 



4 


■ 

■ 



I 

i 

3 

12 

10 

41 

4 

D 



1 


5 

* 

4? 

51 

3 

iz 

27 

S? 

47 


7 


7 


1 

J 

B 




6 










5 






B 






K 















1 


B 




B 






* L 

57 


1 

S 


1 


1 

23 



ES 


3 

lo 

B 



5 

B 

12 


B 


1 

B 

3ll 

22 

1 

1 

1 

52 




23 




13 


8 

B 

■ 


B 

B 

4 


B 

B 


B 

Eh 

H 

( 

« 

40 

54 

7 

II 

1 

zo 

D 

B 

B 

B 

10 

17 

6 

B 

3 

B 


4 

7 

B 

B 


B 

0 


5 

1 

3 

1 

1 

z 

1 

21 

D 

B 

B 

q 

IO} 


7 


27 

>3 

B 

52 

2 

B 

B 

2 

B 

p 

30 


1 

1 

D 

" " 

i 


Z 

3 



II 

B 


35 

7 


34 

• 

6 

4 

- 

B 




Q 



i 


















S4 






R 

6-2 

2 

10 

13 

127 

2 

6 


24 

1 


16 

II 

<8 

27 

5 

3 

7 

H 

11 

6 

7 




1 

9 

4-1 

2 

16 

32 

73 

5 

z 

1 

36 

2 


B 

8 

6 

22 

24 

Ki 

8 

□ 

33 

24 

4 



I 


T 

40 

1 

7 

22 

78 

4 

1 

z 

67 

1 


12 

4 

4 

M 

B 

7 

44 


10 

II 

2 





U 

,»Z 

3 

|o 

5 

31 

4 

11 - - 

3 

1 

2+ 

3 


13 

6 

3- 

24 

1 

B 

1 

48 

26 

n 

\ 

8 


13 

1 


V 





24 




16 





T 

16 

B 


3 



2 

_i 





W 













_ 



B 



b 

B 


B 

B 

B 



X 

4 


3 

3 

1 

3 



• 

1 

B 

■ 


B 

B 

B 

B 

B 

B 

B 

B 

B 

B 

B 

B 

B 

B 

Y 

2. 




B 

■ 

H 



E 

B 


E 

B 

0 

B 

B 

B 

n 

B 

B 

B 

B 

B 

B 

B 

Z 




i_ 

B 

i 

B 


T 

i 

B 


m 

s 

B 

B 

B 

B 


B 

B 

B 

B 

B 

B 

B 
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■ * m m 3 . _ 

2-1). Digraphic kappa plain, French language = .0093 

2-c. The $r digraphs comprising 75$ of French plain text, based on the 
tab le of 9,000 digraph3~Tltom 2-a), arranged according to thoir relative 
fx equenoio?. _ ’ 


rES 

'-154 

1,237 3 i 

EC 

49 

I HD 

40 1 EE 

30 

SP 

24 

HI 

20 

EE* 

127 

i?r 

' 58 

III 

ho 

2,5151 

IIC 

29 

SU 

24 

DI 

20 

0W 

‘109 

EM 

58 

ED 

48 

TA 

40 

AU 

29 

RI 

24 

Cl 

20 

*EJS 

109 

LA 

57 

CO 

48 

UE 

39 

IR 

27 

VE 

24 

AC 

20 

Ejsr 

105 

EL 

56 

UR 

48 

EP 

38 

EU 

27 

TS 

23 

UT 

19 

HT 

99 

QU 

54 

CE 

47 

AL 

36 

n. 

27 

MI 

23 

HO 

19 

LE 

95 

HE 

54 

IT 

47 

SI 

36 

RO 

27 

LI 

23 

RT 

19 

ER 

89 

HS 

53 

AT 

46 

PO 

35 

OR 

27 

SO 

22 

HA 

19 

TE 

”78 

ME 

52 

TR 

44 

PR 

34 

DU 

26 

m 

22 

DA 

18 

SE 

75 

IS 

52 

SA 

42 

ST 

33 

LL 

26 

TD 

22 

AS 

17 

AH 

68 

OU 

52 

IE 

4l 

SD 

32 

US 

26 

AP 

21 

EV 

17 

TI 

67 

10 

51 

AR 

lil 

PA 

30 

UN 

26 

01 

21 



RA 

62 

AI 

50 

SS 

4l 

EA 

30 

UI 

24 

OM 

21 

I 3,751 


2-d. Frequent digraphs la French plain tort ylioso reversals a rc also , 
f requent, accompanied by thoir frequencies fron the table of 5,000 digraphs 
Tltca 2-a) . 


ES 154 ! SE 

75 

LE 

95! 

EL 

56 

RA 

62 ] AR 

4l 

IS 

52 ! 
49( 

SI 

36 

RE 127 i ER 

89 

TE 

78 i 

ET 

58 

EM 

58 J ME 

52 

LU 

CE 

47 

DE 109 ! ED 
EH 105 ! HE 

48 

54 

TI 

67! 

XT 

47 

LA 

57 ! AL 

36 

AT 

46 ! 

TA 

40 


2-e. Frequent digraphs In French plain text i-hoae reversals arc rare or 
I nfrequent, accompanied fry their frequencies iron the tabic" of"57030 digrapha 
Tifcon 2-a) . 

HI 99 S TN 4 j QU 54; UQ 1J KS 53 i SN 6 | 0U 52 } U0 1 

^ i 

2-f. Doublets occurring in French plain text, arran ge d according to their 
frequencies from the table of 5,000 digraphs (item 2-a) . 


SS 

4l 

LL 

26 

NN 

10 

PP 

9 

CC 

6 

AA 

2| 

EE 

30 

MM 

,13 

TT 

10 j 

RR 

7 

FF 

6 

DD 

1 1 


1 

1 


2-g. The 22 digraphs appearing 100 or. more times as beginnings of words 
in 10, 748 words in French plain text, arranged according to their absolute 
frequencies . 


DE 

501 

RE 

283 

PO 

222 

CU 

168 

AU 

150 

DI 

124 

CO 

394 

PA 

268 

IN 

178 

CE 

163 

NO 

133 

AL 

122 

QU 

PR 

347 

291 

LE 

240 

SE 

178 

ET 

153 

TR 

127 

UN 

122 


> 

117 

112 

101 


■*" The 13 digraphs 
^ The 39 digraphs 


above this line compose 25$ of French plain te’cfc, 
above this line compose 50$ of French plain text. 
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3-a» The 97 trigraphs appearing XOO or more times in 59,758 letters of 
French plain text, arranged according to their absolute frequencies . 


ENT 588 
ION 555 
TIO 433 
ONS 373 
RES 367 
QUE 338 
DES 313 
EDE 305 
EME 288 
ATI 287 
EES 284 
NEE 281 
TRE 280 
MEN 272 


CON 271 
ERE 267 
ANT 238 
ESE 230 
ELA 227 
LEE 216 
FAR'213 
NDE 211 
SEE 210 
DEL 209 
ERE 206 
OUR 205 
RAN 196 
IRE 191 


EST 188 
ERA 185 
ECO 184 
ESD 179 
OND 175 
EEM 173 
NCE 173 
EEE 172 
ESA 163 
TEE 163 
ITE 162 
SSE 160 
ONT 157 
ARC 153 


ESS 151 
AIT 147 
POU l46 
TER 146 
COM 143 
ESP 139 

ous 139 

AIS 137 
EMA 137 
IER 136 
NTS 135 
TES 135 
EQU 133 
IQU 131 


NSE 130 
REN 127 
SQD 124 
AIR 123 
EPA 120 
QUI 120 
SET 120 
REC 119 
AND 118 
ETA 118 
SEN 118 
PRO 117 
ISE 116 
REP 116 


EUR 115 
NTA 115 
SER 115 
ESO 112 
DEC 110 
EPR no 
ALL 109 
ECE 109 
UHE 108 
RAI 106 
RLE 106 
SSI 106 
EDE 105 
SUR 105 


TRA 105 
ISS 104 
IHT 103 
TEN 103 
UEL 102 
AHS 101 
BLE 101 
QUA 101 
CES 100 
ETE 100 
ETR 100 
OHM 100 
TAT 100 


3-b. The 20 •trlgraphs appearing 50 or more times as Beginnings of words 
In 10,74b vords In French plain text, arranged according to their absolute 
frequencies. 


CON 213 
POU 144 
FEE 135 


COM 129 
PRO 105 
ALL 104 


FRA 93 
PAR 87 
QUA 80 


INT 75 
CEN 72 
NOU 69 


ETA 69 
DAN 68 
RED 65 


SER 61 
TRA 57 
RES 56 


vou 56 

FAI 50 


4* The 82 tetragraphs appearing 50 or more times in 55»758 letters of 
French plain text, arranged according to their absolute frequencies . 


TION 431 

CONS 

98 

LEME 

83 

ERAL 

71 

EHEN 

58 

RESS 

55 

MENT 251 

EPAR 

98 

QUEL 

83 

ERES 

70 

ESSE 

58 

IERE 

53 

ATIO 220 

RESE 

96 

LEMA 

80 

DANS 

67 

NOUS 

58 

IRES 

53 

IONS 208 

ENTE 

95 

PORT 

80 

OUVE 

67 

TRES 

58 

TEDE 

53 

EMEN 200 

LLEM 

93 

ENTS 

78 

EMAN 

66 

ENER 

57 

EQUE 

52 

POUR 136 

FRAN 

91 

EPRE 

77 

SENT 

66 

DDES 

57 

NDEL 

52 

IQUE 128 

PRES 

91 

EDES 

76 

ANDE 

63 

NSEI 

57 

ECOM 

51 

IOND 124 

ENTA 

90 

ESET 

76 

PART 

62 

NTDE 

57 

GENE 

51 

DELA 120 

RANC 

90 

INTE 

76 

SDES 

62 

CAIS 

56 

SEIL 

51 

AIRE 117 

MCE 

89 

ALLE 

75 

ESEN 

61 

ESTI 

56 

ELES 

50 

ONDE 107 

SION 

89 

ANTE 

75 

RAPT 

61 

mo 

55 

ETAT 

50 

ECON 102 

COMM 

88 

MAUD 

75 

ENTD 

60 

NEIffi 

55 

ILLE 

50 

ESDE 102 

ELLE 

84 

CENT 

74 

SSIO 

60 

NERA 

55 1 

SQUE 

50 

ONSE 101 

NEER 

84 

QUES 

72 

ENCE 

59 






5# Average length of vords in French plain tesefc » 5,2 letters . 


5-10 










»C . ITALIAIf IETTER FRE^UEIJCY DATA 


In o21 calculations, accented, letters have "been combined with the 


corresponding unacconbpd letter. , - . „ 


*e - r i 

1-a. Absolute frequencies 3f single letters*of Italian plain text, arranged 


alphabetically, based oh 51 




trfiirl 


A 

6,771 G 

1,168 

L 

3,592 Q 

227 

V 

1,024 

B 

527 H 

493 

M 

l,44l R 

4,037 

w 

13 

C 

2,367 I 

6,568 

H * 

4,094' s 

2,967 

X 

? 

D 

2,258 J 

18 

0 

5,022 T 

4,139 

Y 

l4 

E 

6,784 IC 

28 

P 

1,616 u 

1,547 

Z 

527 

F 

655 








57,906 


1-d. Monographic Kappa plain, Italian language s .0745 

1-c. Frequency distribution of single letters based on 97,906 letters in 
Italian plain text, reduced to 1,000 letters and arranged according to their 
frequencies.- ’ - »> » 


E 

117' 

R 

70 

P 

' 28 

F 

11 

A 

1D7 

L 

62 

u 

27 

B 

9 

I 

113 

S 

51 

M 

25 

Z 

9 

0 

87 

C 

4i 

G 

20 

H 

9 

T 

H 

72 

71 

D 

39 

V 

18 

Q 

4 


1-d. Percentage of vowels 
sonants, and low-frequency 


1,000 

, high-frequency consonants, medium-frequency con- 
consonants in 57,906 letters of Italian plain 


text. Percentage of 6 most frequent letters in Italian plain text . 

Vowels A,E,I,0,U, and Y = 46.1$ 

High-Frequency Consonants L,N,R, and T a 27.4$ 

Medium-Freguency Consonants C,D,G,M,P,S, and V - 22.2$ 

Low-Frequency Consonants B,F,H, J,K,Q,W,X, and Z = 4.3$ 

(Listed in descending order of frequency) 

8 most frequent letters E,A,I,0,T,N,R,L - 70.8$ 

1-e. Absolute frequencies of single letters as initial letters of 10,48l 
words in Italian plain text, arranged according' to~"their frequencies. (One 
letter words have been omitted.) 


D 

1,381 

L 

500 

T 

337 

u 

217 

C 

i,o4i 

R 

403 

G 

333 

Q 

172 

S 

885 

N 

396 

F 

298 

B 

153 

P 

830 

E 

374 

V 

263 

H 

69 

A 

I 

822 

685 

M 

371 

1 

t 

0 

235 

Z 

29 
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2-a. Frequency distribution of digraphs based on 57,$±7 letters of 
Italian plain text, reduced to 5,000 digraphs’! 


2,4 Letter* 

BCDEFGHIJKLMN OPQRSTUV. WX 

1 IsiUlli+liilsil i [ill I 74ktf7s| s |i+[ + In st|«f 6 fatl | I 


Y Z 



sN J J J JJ-LL 









of Italian, plain text, "based on the 


ed according to their relative 


ER 96 
ON 86 


TA 78 
AN 78 


AL 78 
EN 73 


RE 71 
NT 66 


DE 65 
TE 6? 


LA 

62 

NE 

50 

IN 

62 

NO 

50 

1,2604 LB 

49 

RA 

62 

IS 

48 

ES 

61 

IT 

45 

TI 

59 

OL 

45 


EL 65 
DI 64 


AR 571 SI 
10 


44 I NA 


CO 64 
AT 63 



MA 

' 3i 

S3 

31 

DA 

31 

EC 

30 

Re 

30 

|j*3 

* 30 

lE 

36 

RO 

•28 

OD 

. 27 

ET 

27 

VA 

r 26 

I® 

26 

SO 


r ■** 

-*9 .«* 

ftef 

i A 


HE 

25 ( VE 

23 

OP 

25 oc 

22 

AM 

24 AG 

22 

UN 

24 EG 

22 

El 

AV 

24 EP 
24 L0 

22 

21 

OM 

24 IP 

20 

PA 

23 ZI 

20 

DO 

23 SA 

20 

VI 

23 CE 

20 

AP 

23 QG 

20 

PR - 

23 GI 

20 

EA 

23 ! 

* If 

f 7W 


frequent, accompanied by their fre 




r*u i 


► 4* t 

ose reversals are also 


000 digraphs 





ER 961 RB 71 EL 66 

ON 86 NO 50 DE 65 

TA $3 | AT 63 RA 64 

AN 78} NA 32 IN 62 

AL 76 | LA 62 ES 62 

EN 73 ] NE 51 TI 59 

Frequent digraphs in Italian 


infrequent, accompanied 


Item 2-a 


} LB 49[fj: ^6 J IL 35 
'ED 53 OR t 55 J RO 45 
! AR -57 IC 521 Cl 33 
i HI Sf IS M\ SI 45 

i SE 32 AD 4l! DA 31 
| IT k< AC 39l ca 32 

*>*■$<*£*-“t 

plain text ■vmpqe reversals are rare-01 


u ^ 
T« . 


i AR *57 IC 52 

i HI '3l IS M 


es from the, table of 5*000 digraphs 



frequencies from the table of 5*000 digraphs {.item 2-aJ « 


LL 

52 

AA 

18 

II 

11 

BN 

10 

j . -s 

FF .7 

MM 

4 

w 

2 

TT 

37 

EE 

15 

PP 

'll 

GO 

8 

ZZ * “5 

00 

4 

DD 

1 

SS 

31 

RR 

12 

CC 

10 

BB 

7 



- 




‘ ***•*■< . 

2-g. The 26 digraphs appearing 100 or more times as beginnings of words in 
10,48l words in Italian plain text, arranged According to their absolute 


E asm as a 


543 I RE 210 I PR l84'| NO 


172* PA 153 
169 PO l4l 
162 CA 132 


1 ** 'V-**:?*-** 

i^T SE*' 121 fife. *£l2 I RE 108 


SO. JL2l 


DE 505 CH 197 CSC 172 PA 153 ML JL2l UN 111 ES 107 

ST 222 AL 186 NE 169 PO l4l TR 121 SU 109 j TE 103 

DI 235 IN 185 RI 162 CA 132 DA 120 

1 The 18 digraphs above this lice com^r^se^^ of Italian plain text, 

2 The 43 digraphs above this line conrprlse $0i> of Italian plain text, 

* ,-*5-13 

* - * U ^ 


































REF ID:A568^2 

m i»+ * 

3-a. The 90 trlgraph3 appearing 100 or more times la 57,906 letters Of 
Italian plain text, arranged according to tneir absolute frequencies > 


DEL 

348 

STA 

215 

ERE 

169 

ICA 

345 

SSI 

130 

ODI 

ll4 

ESI 

ENT 

348 

ALT 

213 

ZIO 

106 

RAN 

145 

IIEL 

327 

ORI 

114 

COR 

ELL 

314 

EDI 

212 

ATO 

165 

STR 

l4j 

AGO 

125 

RMA 

114 

IAN 

CON 

306 

ALL 

201 

NTI 

165 

ALE 

144 

ATI 

125 

AKB 

113 

TAM 

CEE 

276 

ITA 

190 

ANT 

163 

3DI 

143 

IDE 

123 

ETT 

113 

ATE 

LIA 

274 

MO 

197 

ERA 

163 

COM 

139 

ADI 

121 

ODE 

113 

NON 

ION 

265 

OST 

196 

TRA 

160 

ECO 

137 

AND 

121 

PRE 

112 

VER 

ONE 

247 

ERT 

187 

ESS 

158 

LLE 

137 

TEN 

120 

NDO 

110 

ICA 

PER 

238 

ARC 

186 

AIT 

157 

OUT 

136 

0N0 

119 

ORI 

no 

OLA 

EDE 

228 

TAL 

184 

NTO 

156 

TER 

136 

ARI 

117 

AZI 

109 

STI 

NTE 

227 

LIA 

180 

ADE 

155 

TAT 

134 

NTH 

117 

ENE 

109 

0C0 

ICO 

216 

1ST 

174 

EST 

151 

TTA 

132 

PAR 

ll6 

ELA 

107 

RIA 

MEN 

216 

OLI 

171 

RES 

146 

AIA 

130 

TRO 

llO 

ERO 

107 



107 

106 

106 

105 

104 

103 

103 

101 

101 

101 

100 

100 


3-b, The 19 trigraphs appearing 50 or more tinios as 'beginnings of words 
in lO/frBl rords in Italian plain text, arranged according to their absolute 
frequencies . 


DEL 

217 

STA 

106 

QUA 

81 

PRE 

62 1 

DAL 

5? 

PER 

55 

GRA 

53 

COR 

195 

ALL 

100 

PRO 

75 

REL 

571 

ARC 

56 

RUS 

55 

STO 

51 

COM 

137 

ITA 

94 

QUE 

74 










4. The 57 tetragraphs appearing 50 or more times in 57,906 letters of 
Italian plain text, arranged according to their absolute frequencies . 


DELL 209 

ALIA 

99 

ICON 

74 

AGLI 

66 

LIAR 

59 

MENT 188 

CONT 

93 

VANO 

74 

ICHE 

66 

TORI 

59 

I0HE l60 

ADEL 

92 

ECON 

73 

IDEL 

64 

ALLE 

58 

ELLA 150 

OSTR 

88 

I0RI 

71 

ELLE 

63 

ANDO 

58 

ZION 147 

ERTO 

87 

STAT 

70 

HELL 

63 

DALL 

58 

TALI 125 

AMEN 

83 

STRA 

70 

IMEN 

61 

NTRO 

58 

AZIO 106 

ALLA 

83 

GLIA 

69 

ANTI 

60 

OCHE 

58 

EDEL 106 

ERZA 

75 

ISTA 

68 

ATTA 

60 

ANTE 

57 

ITA£ 106 

ORTR 

75 

ODEL 

68 

PART 

60 

EPER 

57 

EHTE 105 

ENTI 

74 

ACON 

66 






0FER 

RUSS 

TAT0 

TEDE 

0C0N 

SION 

TART 

STOP 

N0ST 


56 

56 

55 

55 

54 

53 

53 

52 

51 


5 t Average length of words in Italian plain text * 5*5 letters. 
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D. SPANISH LETTER FREQUENCY MtA 1 


1-a* Absolute frequencies of single letters of Spanish plain text, arranged 
alphabetically, based on 60,115 letters of text , * ~~~ 

1 — t ^ 

V *602 

W 2 3 6 

X 127 

Y 413 

Z 18? 

- 

, So; 115 

41 ’ * i ^ 4 ! 

1-b. 

1-c. 


A 

6,681 

G 

823 

L 

2,174 

Q 

* 346 

B 

79? 

H 

367 

M 1 

l, 7 *i 0 

R 

4,628 

C 

3,137 

I 

4,920 

IT 

4,823 

S 

4,l4o 

D 

2,687 

J 

190 

0 

" 5,859 

T 

3,180 

E 

F 

7,801 

48l 

K 

22 

P 

* 

,.^785 

U 

* 

2,172 


_C" 

- . *» 1 t * ,"? + * 
Monographic kappa plain, Spanish language" - .0747 


—■ ^ «- r- -*» »r# f <*• , " 

Frequency distribution of single letters'based on 60,115 letters in 
Spanish plain text, reduced to 1,000 letters, and arranged according to 
their frequencies . _ _ 


E 

A 

0 

I 

N 

R 


130 

111 

r97 

82 

80 

77 


t 

s 

69 

U 

36 

r a 

V 

, 10 

T 

53 

P 

'Jo 

F 

' 8 

C 

52 

M 

29 

Y 

7 

D 

45 

G 

l4 

H 

6 

L - 

36 

B 

13 

Q 

6 


J 

Z 

X 

W 

K 


3 

3 

2 

1 


1,000 


1-d. Percentage of vowels, high-frequency consonants, medium-frequency 
consonants, and low-frequency consonants in 60,113 letters pf Spanish 
plain text. Percentage of 8 most frequent letters in Spanish plain text , 

* £ rr t 4 C 

Vowels A,E,I,0,U, and Y = 46.3# 

High-Frequency Consonants U,R, and S = 22.6# 

Medium-Frequency Consonants C,D,L,M,P, and T = 24.5# 

Low-Frequency Consonants B,F,G,H,J,K,Q,V,W,X, and Z ■ 6,6# 

(in descending order of frequency) 


»- 


I V 


8 most frequent letters, E,A,0,I,N,R,S, and T •» 69.9# 

1 

1-e. Absolute frequencies of single letters as initial letters of 10,129 
words m Spanish plain text, arranged according to their frequencies. (One * 
letter words have been omitted) 7 


P 

1,128 

L 

435 

Q 

286 

V 

183 

Y 

27 

C 

l, 08 l 

R 

425 

I 

281 

F 

177 

W 

19 

D 

1,012 

M 

403 

H 

230 

0 

169 

Z 

2 

E 

989 

N 

’ 346 

U 

219 

B 

124 

K 

1 

S 

789 

T 

298 

G 

206 

J 

47 

X 

- 

A 

761 










10,129 


Includes N throughout all tables. 

From foreign words appearing in Spanish plain text. 
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2-b. Digraphic kappa plain, Sparish language « '.0091 


2-c. The 

8 ? digraphs 

comprising of (Spanish plain 

text 

, based on the 

table of -5 

,000 digraphs, (Item 2-e.) 

, arranged 

according to 

their relative 

frequencies. 



4 




* 

f * . 


i. 


EE 

12 6 

TE 

67 

IN 50 

NA 

4l 

MA 

32 

IS 

27 

EA 

20 

E3 

119 

/II 


EC 47 

in 

4o 0 

SA 

32 

EH 

26 

OA 

19 

ON 

104 

1,28 r 

PI 45 


PO 

31 

SP 

26 

FU 

19 

EH 

94 

AD 

a 

EL 44 

CA 

39 

m 

30 

ED 

26 

SC 

18 

EE 

94 

AS 

62 

LA 44 

KD 

37 

PA 

30 

OD 

26 

AT 

18 

ET 

91 

TA 

60 

RO 43 

TI '35 

AD 

30 

AP 

24 

CU 

18 

DE 

84 

DO 

59 

NO 43 

IJ2 

35 

DI 

30 

IT 

24 

EE 

17 

AP 

8] 

OR 

58 

IA 43 

TR 

34 

ID 

29 

EP 

23 

OB 

17 

Cl 

80 

faE 

5r 

IC 42 

UN 

34 

CU 

29 

SU 

23 

CE 

17 

RA 

74 

CT 

57 

IE 42 

PR 

34 

OP 

29 

SO 

22 

ET 

17 

03 

73 

TO 

5 c 

AL 4l 

OM 

33 

LI 

28 

OL 

22 

LO 

17 

CO 

69 

AC 

54 

SI 4l 

NC 

33 

NI 

28 

1.3 

21 



10 

67 

UE '52 

BE 4l 

j 

DA 

32 

OC 

28 

EG 

21 

3,753 

2-d. Frequent 

digraphs JLn Spanish plain tejt whose 

reversals ore also 

frequent. 

accompanied by their 

frequencies 

from the 

table 

of 5,000 digraphs 

Kltem 2-a) 













- t 


- -* J 

v * 

** 

t 

/- 

■Jt 







EN 126 

i HE 

4l 

AR 

8l J RA 74 

“as 

62 

J SA 32 

LA 44 J AL 4l 

ES 119 

! SE 

57 

Cl 

80 1 IC 

42 

OR 

58 

i RO 43 

EL 44 J LE 35 

ON 104 

! NO 

43 

AIT 64 J HA 

41 

AC 

54 

! CA 39 

MA 32 1 AM 30 

ER 94 

J RE 

94 

AD 64 | DA 32 







- 

2-e . Frequent digraphs* l4 Spanish'plain text 

whose 

reversals are rare or 

infrequent 

, accompanied by their frequencies from the 

table of 5,000 

digrapho (Item 

2-a) 

« 


1 

» 


t 




*. 

NT 91 

' TN 0 

ST 

57 | TS 

0 

ND 37 

! DN l| 

EC 

33 | CN 0 

10 67 

! <5i 



> * 

1 T 

» r ' 






</< 1 

2-f. Doublets 

occurring in Spanish plain text, arranged according to their 

frequencies from the table of 

5,000 

digraphs (Item 

2-8 

2. 


EE If | AA 12 

RR 10 | SS 

10 | LL 9 | CC 

>1 

00 4 

NN 3 | 

DD 2 

2-g. The 

21 digraphs 

appearing 100 or more times as beginnings of words 


In 30,329 words in. Spanish plain text, arranged according to their absolute 
frequencies 


CO 684 
EE 335 
DE 323 


PR 307 
ES 286 
QU 286 


PA 263 
PO 247 
IN 235 


SE 189 
DI 175 
pa 157 


CA 151 
SI 137 
MI 117 


PE 111 
UN 109 
HA 108 


MA 101 
CU 100 
SO 100 


1 

The 15 digraphs above this line comprise 25> of Spanish plain text # 
^ The 40 digraphs above this line comprise 50$ of Spanish plain text. 
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The 10? trlgraphs appearing 100 or more times in 60,115 letters of 
Spanish plain text, arranged according to their absolute frequencies. 


ENT 596 ARA 229 POR 176 OSE 

ION 564 ONE 227 TER 174 ONS 

CIO 502 ESE 217 ODE 168 REC 

NTE 429 ADE 202 ERE 366 ORE 

CON 415 PAR 193 ERA 165 0C0 

EST 355 CIA 190 TRA 165 DDE 

EES 335 ENC 190 AMS 163 ICI 

ADO 30 7 NCI 188 ERI 162 END 

QUE 294 PRE 184 MER 159 SEN 

ACI 277 DEL 183 ELA 158 TAD 

NTO 270 NDO 183 PRO 155 ECO 

IEN 267 NES 183 ACO 153 STR 

COM 246 DOS 182 ENE 151 TOS 

ICA 242 MEN 181 UES 150 IDA 

STA 240 HTA 176 ESP 149 SDE 


147 ERO 131 NDE 121 PER 111 

144 ORT 131 RAN 121 ASE 109 

344 ANA 130 STB 119 CAN 109 

143 ARE 130 REIT 118 UNI 108 

142 UNT 129 ARI 117 OSI 107 

l4l ANO 127 TEN 116 GEN 105 

l40 TAR 127 01© 115 NCO 105 

139 ANT 126 RIA 115 RIO 105 

139 ESA 126 EC I ll4 ERN 104 

138 IER 126 1ST 113 OMI 104 

135 ADA 125 ONA 113 SCO 104 

134 DEN 124 DAD 112 TES 103 

133 AND 123 INT 112 BIE 101 

132 DES 121 NTR 112 NTI 100 

132 IDO 121 ESI 111 TOR 100 


3-"b. The 19 trigraphs appearing 50 or more times as beginnings of words 
in 10,129 words in Spanish plain text, arranged according to their 
absolute frequencies. 


CON 

298 

PAR 

154 

PUN 

93 

INT 

72 

UNI 

55 

CUA 

52 

REP 

51 

COM 

218 

PRO 

139 

PER 

80 

RES 

72 

DES 

53 

TRA 

52 

AEG 

50 

EST 

194 

PRE 

114 

I GOB 

77 

NUE 

66 

INF 

53 






4. The 86 tetragraphs appearinf; 50 or more times In 60,115 letters of 
Spanish plain text, arranged according to their absolute frequencies. 


CION 444 J CONS 104 I ERNO 79 

AGIO 252 CONT 99 IERN 78 

ENTE 233 PUNT 95 OQUE 78 

ESTA 174 ANDO 91 IONA 77 

I0NE 159 TADO 91 UEST 77 

MEET 150 AGON 90 BIER 76 

ONES 146 ANTE 89 ICAN 76 

IENT l4l NTER 85 RESE 76 

EI1T0 137 INTE 84 GOBI 75 

ENC I 128 NTES 82 OBIE 75 

PARA 117 ADOS 8l ECON 74 

EHTA 115 AMEN 8l RGEN 73 

NCIA 115 OCON 8l RICA 73 

PRES 111 ESEN 80 STAD 73 

UNTO 111 ONDE 80 


AMER 72 f FORM 62 EEST 55 

IEND 72 SENT 62 SCON 55 

3DAD 71 ICIO 6l SIDE 55 

ENDO 70 ONTR 60 CIEN 54 

ERIC 70 SION 60 NFOR 54 

NTOS 70 CCIO 59 OPOR 54 

MIEN 69 GENT' 58 RESP 54 

IOND 67 COMA 57 ARIO 53 

MERI 67 ESDE 57 ESTR 53 

NTRA 67 ORES 57 ARGE 51 

DELA 65 REC I 57 ECTO 51 

ENTI 64 AQUE 56 PART 51 

NTIN 64 IONP 56 POSI 51 

COMI 63 QUES 56 EPRE 50 
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E. PORTUGUESE T JITTER FREQUENCY DATA 


1-a. Absolute frequencies of single letters of Portuguese plain text, arranged 
o 7 phabetically, Ij'ascd on 45 >106 3 eVfccra of text . 


A 

5,362 

G 

724 

L 

1,245 

Q 

348 

V 

737 

B 

470 

H 

304 

M 

1,699 

R 

3,292 

w 

24 

C 

2,285 

I 

3,31 } » 

N 

2,912 

S 

3,409 

X 

166 

D 

1,900 

J 

160 

0 

5,001 

T 

2,679 

Y 

22 

E 

,5,441 

IC 

17 

P 

1,377 

U 

1,491 

z 

207 

F 

520 



- 









1-h. Monographic kappa plain, Portuguese language = .0746 

l~c. Frequency distribution of single letters based on 45,306 letters of 
Portuguese plain text, reduced to 1,000 letters . 


E 

121 

N 

65 

U 

33 

F 

ll" 

X 

4 

A 

119 

T 

59 

P 

30 

B 

10 

J 

3 

0 

111 

C 

51 

L 

28 

Q 

8 

w 

X 

S 

76 

D 

42 

V 

16 

H 

7 

Y 

- 

I 

73 

M 

38 

G 

16 

Z 

5 

K 

M 

R 

73 






- 

- 



1,000 


1-d. Percentage of vowels , high-frequency consonants, medium-frequency con ¬ 
sonants, and, low frequency^consonants in 45,106 letters of Portuguese plain 
text. Percentage of 8 most Pi oquest letters in Portugese plain text . 

Vowels A,E,I,0,U, and Y = 45.8# 

High-Frequency Consonants N,R, and S * 21.3# 

Medium-Frequency Consonants C,D,L,M,P, and T » 24.8# 

Low-Frequency Consonants B,F,G,H,J,K,Q,V,W,X,Y, and Z * 8.1# 


(in descending order of frequency) 


8 most frequent letters E,A,0,S,I,R,N, and T - 69 . 7 # 


l-e. Absolute frequencies of single letters as initial letters of 7.058 words 
in Portuguese plain text, arranged according to their frequencies . 


p 

847 

M 

405 

I 

264 

B 

313 

c 

731 

T 

348 

F 

222 

G 

111 

E 

608 

R 

316 

Q 

222 

J 

92 

S 

601 

IT 

299 

0 

187 

U 

77 

A 

D 

597 

506 

V 

271 

L 

143 

H 

60 


***>&■- , 
14 
11 


7 

4 , 

2 ,, 

7758 

* 
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2 -b. Digraphic kappa plain, Portuguese language * .0084 

i 

2 -c. The 93. digraphs conqurislng 75# of Portuguese plain text, “baaed on the 
table of 5.000 digraphs (Item 2-a), ar^ahged according to their relative 
frequencies . 


NT 

114 

TA 


ST 

47 

AM 

, 36 

CE 

30 

OD 

25 

AT 

22 

EN 

97 

lj 

iSsP- 

RI 

46 

2 ) 

,505 a 

NC 

29 

NO 

25 

UA 

22 

ES 

95 

SE 

62 

DA 

45 

HD 

35 

PR 

28 

LA 

24 

OA 

21 

TO 

88 

DO 

61 

m 

44 

OP 

35 

3T 

27 

LE 

24 

LI 

21 

RE 

86 

BE 

61 

IA 

42 

SP 

35 

QE 

27 

AP 

23 

OL 

20 

CO 

85 

AD 

60 

MA 

4i 

RO 

34 

El 

27 

EG 

23 

ET 

20 

OS 

85 

FO 

60 

SA 

4i 

IC 

34 

UE 

26 

VI 

23 

01 

20 

ON 

79 

CA 

60 

SS 

4o 

TR 

33 

MI 

26 

SO 

23 

NS 

19 

ER 

76 

AH 

56 

Cl 

39 

DI 

33 

10 

26 

SI 

23 

SU 

18 

- RA 

75 

IN 

53 

IS 

39 

OC 

32 

PA 

26 

OV 

22 

RT 

18 

AS 

72 

AC 

52 

AL 

38 

EL 

31 

TI 

26 

SC 

22 

EP 

18 

OR 

71 

ME 

51 

VE 

37 

ID 

31 

PE 

25 

IM 

22 

UI 

18 

TE 

69 

AO 

49 

qu 

37 

NA 

31 

IR 

25 

ED 

22 

3j 

,755 

AR 

68 

EC 

48 

OM 

36 





« 





2 -d. Frequent digraphs In Portugese plain text whose reversals are also 
frequent, accompanied by their frequencies from the table of 5.000 digraphs 

Xltem"2-a7'~. 


ES 

95! 

SE 

62 

OR 

7l! 

RO 

34 

ME 

51! 

EM 

44 

RE 

86 « 

ER 

76 

CA 

60 ! 

AC 

52 

EC 

48; 

CE 

30 

CO 

85! 

OC 

*32 

AD 

60 ■ 

DA 

45 

KA 

4l| 

AM 

36 

RA 

75 ! 

AR 

68 

PO 

60 ! 

OP 

35 

Cl 

39 

IC 

34 

AS 

72: 

SA 

4l 

AN 

56! 

NA 

31 

DI 

33 1 

ID 

31 


2-e. Frequent digraphs in Portuguese plain text whose reversals are rare 
or infrequent, accompanied by their frequencies from the table of 9,000 
digraphs (Iten 2-a)" « 

HT 114 j TN 1 i ST 4? | TS 0 | ND 35 i DN 0 

2-f. Doublets occurring in Portuguese plain text, arranged according to 
their frequencies from th'e table of 5>000 digraphs 7ltem2-a) . • 


SS 

40 

EE 

11 

00 

5 

LL " 

2 

AA 

11 

RR 

11 

CC 

2 

m 

2 


11 PP 11 TT 


1 


2-g. The 20 digraphs appearing 100 or more times as 'beginnings of words In 
6,803 -words in Portuguese plain text, arranged according to their absolute 
frequencies . 


CO 464 
PO 386 
se 333 


RE 276 
BE 259 
qjj 220 


IN 188 
ES 173 
PR 169 


PA 143 
NA 133 
TE 132 


HA 130 
PE 122 
VE 115 


ME 111 
MI 105 
NO 104 


TR 103 
DI 102 


1 

2 


The 15 digraphs above this line compose 25# of Portuguese plain text 
The 42 digraphs above this line compose 50# of Portuguese plain text 
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3-a. The 59 trlgraphs appearing 100 or more times In 45,106 letters of 
Portuguese plain text, arranged according to their absolute frequencies . 


ENT 

474 

TOS 

191 

ERE 

150 

IDA 

133 

OSE 

126 

ECE 

115 

ASE 

105 

NTO 

457 

EST 

186 

CIA 

145 

TER 

132 

ARE 

125 

NCI 

n4 

ITO 

104 

OUT 

303 

ACA 

182 

ADE 

143 

OPO 

130 

ESE 

124 

REC 

113 

ELE 

103 

DTE 

284 

RES 

181 

STA 

143 

SPO 

130 

OVE 

124 

PAR 

112 

ERI 

103 

CON 

255 

QUE 

172 

ICA 

142 

ADA 

129 

SSA 

124 

ESS 

110 

PRO 

102 

PON 

236 

NTA 

167 

OCO 

140 

TRA 

129 

DES 

123 

DAD 

109 

AME 

101 

CAO 

227 

POR 

159 

ARA 

136 

NDO 

127 

ECO 

121 

ORE 

108 

OSS 

101 

ADO 

‘211 

AGO 

158 

DOS 

134 

ENC 

126 

ODE 

118 

EDI 

107 

IME 

100 

MEN 

205 

COM 

154 

OES 

134 










3-D. The 19 trlgraphs appearing 50 or more times as beginnings of vords 
In 6,497 vords in Portuguese plain text, arranged according to their 
absolute frequencies . 


CON 224 
PON 213 
COM 136 


QUE 109 
EST 105 
PAR 93 


PRO 

93 

QUA 

83 

TRA 

66 

VEX 

531 

! RES 

52 

POR 

88 

DES 

71 

MIL 

61 

IND 

52 | 

! REC 

51 

NAO 

86 

SER 

70 

REF 

56 






4. The 38 tetragraphs appearing 50 or more times In 45,106 letters of 
Portuguese plain text, arranged according to their absolute frequencies * 


ONTO 233 

ENTA 

97 

AMEN 

81 

CONT 

68 

CONS 

58 

PONT 221 

NCIA 

95 

PARA 

81 

FORM 

67 

NTES 

58 

24ENI 183 

PORT 

87 

COES 

73 

OCON 

66 

ANDO 

57 

EIITO 173 

DADE 

86 

IDAD 

71 

ELEG 

61 

ANTE 

57 

EHTE 147 

ESTA 

85 

CENT 

70 

ADOS 

60 

ORMA 

54 

ACAO 142 

ENCI 

83 

INTE 

70 

IMEN 

60 

VEXA 

54 

NTOS l4l 

SPON 

83 








RENT 52 
TELE 52 
EGRA 51 
NFOR 51 
OPON 51 
LEGR 50 


5# Average length of voids in Portuguese plain text = 6.48 
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F* RUSSIAN LETTER FREQUENCY DATA 


5,122 

1,095 

3,51*3 

1 , 11*1 

2,076 

5,537 

502 


base 

cTFn'577S5’Cr: 

letters 

of text. 


3 

1,280 

H 

U,h63 

y 

1,578 

Wi 

M 

1*,923 

O 

8,078 

+ 

127 

bl 

W 

14 

961 

n 

1,815 

X 

91*1 

b 

K 

2,32k 

P 

3,1*2? 

u. 

369 

* 

J\ 

2,7l*7 

c 

3,917 


902 

to 

M 

1,936 

T 

l*,ol*i 

Ui 

551* 

$1 


text, arranged 


257 

1,I{21 

960 

173 

1*55 

1,185 

57^53 



mage s ,0568 




HP 


SaasagaKa 


o 

119 

B 

52 

n 

27 

G 

16 

*• 

E 

, 82 

P 

50 

y 

23 

PI 

Hi 

K> 

A 

75 

J\ 

1*0 

bi 

21 

b 

H* 

U, 

*4 

73 

K 

31* 

3 

19 

X 

H* 

W, 

H 

66 

A 

31 

9i 

17 

M 

13 

» 

T 

60 

M 

29 

r 

17 

uu 

8 

4 


%?HS_ 


1-d* Percentage of vowols, high-fr e quency conso nants, med i um- fr equency co n 
sonants, anTTou^j^eqacncy consonants xn oT7S5^T!eIIers of"E,uss?an pTairntext 




Vowels A, E , 14 , ft , O, V, bl, } ,K>, and 9 a 1*3.1$ 

High-Frequency Consonants B, H, P, C, and T a 28.6$ 

Medium-Frequency Consonants B,r, I n,j^>K,Jl,M,n,X, t li and b s 25*1 
Low-Frequency Consonants )&, ^ , LA , ui, and ui, a 2 ,6% 

10 most frequent letters (in descending ordej* of frequency) O, E, A, 
W, H,T, C, B, P, andJl a 67.5# 

1-e# Absolute freque ncies of single letters as initial letters of 10,601 

wordsjK^^sj^n^^MSr^^^ J( _arra : ngeT^cc^^[^^^3H^^^Sin^!^S* 

'(One-letter words ha!ve been omitted.) 


n 

1,210 

A 

1*96 


321 

X 

120 

c 

983 

M 

1*1*6 

r 

292 

A 

116 

H 

800 

p 

1*29 

y 

229 

£ 

92 

B 

731 

T 

1*18 

H 

182 

¥ 

72 

O 

650 

3 

1*01* 

> 

11*7 

UI 

63 

K 

555 

s 

3UU 

J1 

11*6 




5-23 


TS^TSI 
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2-a* Frequency dis tribution of digraphs based on 67»05>Q loiters of 

to 5>.wD dig raphs, 

2.4. LeHer 

ABB r AE)|t3HftkJlMHon PC T V + XU.MUUHW b } IO 51 

A [TTl2 f 35 I 8 f 11 rTTc fj51 7l 71191 2711 9 h 5 p". 1 11 Tvfi 131 127 T 3 T 1110 j«]" 7 f 1 ol 3 f~ f Islof 9 
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2-b. Digraphic kappa plain, Russian language s .005? 

2-c. The 159 digraphs comprising 75$ of Russian plain text, baaed on the 
table of 5,000 digraphs (Item 2-a ) , arranged according ttTtheir relative 
frequencies. ”” 


0B 

84 

UP 

39 

J 10 

30 

EM 

24 

AM 

19 

AH 

14 

HP 

11 

bh 

10 

CT 

82 

OM 

38 

Jib 

30 

PH 

24 

OK 

19 

IBM 

14 

CC 

11 

aid 

10 

HO 

67 

FE 

37, 

he 

29 

IHI 

24 

TO 

18 

yc 

14 

IDE 

11 

jin 

9 

KO 

66 

izssr 

MT 

29 

M3 

23 

Bbl 

18 

bH 

13 

An 

11 

PT 

9 

EH 

63 

EC 

37 

oft 

29 

f TE 

23_ 

03 

18 

CJI 

13 

HH 

11 

HB 

9 

m 

58 

JIM 

36 

MC 

29 


MA 

18 

HH 

13 

TH 

11 

BH 

9 

BO 

58 

Hbl 

36 

T H 

28 

m 

23 

XO 

18 

m 

13 

Mft 

11 

HC 

9 

PO 

56 

BA 

35 

OB 

28 

HO 

22 

on 

18 

EK 

13 

yH 

11 

EE 

9 

TO 

56 

3A 

35 

AT 

27 

MB 

22 

EB 

18 

HH 

13 

sr 

11 

3H 

9 

PA 

55 

EJI 

35 

TB 

27 

HJI 

21 

CH 

17 

HH 

13 

MH 

10 

EE 

9 

HA 

54 

AB 

35 

EH 

27 

Tb 

21 

MX 

17 

OM 

12 

E3 

10 

My 

9 

ro 

50 

TA 

35 

AJI 

27 

ME 

21 

m 

17 

>KE 

12 

ftc 

10 

HB 

9 

oc 

50 

HE 

34 

CO 

27 

ME 

21 

BM 

17 

AE 

12 

nn 

10 

HT 

9 

AH 

48 

ET 

33 

KW 

26 

EO 

21 

py 

16 

HA 

12 

AX 

10 

3T 

9 

Ofl 

47 

JIE 

33 

AP 

26 

m 

20 

En 

16 

HE 

12 

m 

10 

AH 

9 

no 

46 

or 

32 

TP 

26 

BC 

19 

UX 

16 

OH 

12 

HT 

10 

CH 

9 

OP 

43 

BjE 

32 

HA 

25 

m 

19 

nE 

15 

TH 

11 

i<y 

10 

yio 

9 

np 

41 

m 

32 

CE 

25 

MO 

19 

A3 

15 

cn 

11 

fly 

10 

30 

9 

OJI 

41 

TE 

31 

JIA 

25 

AK 

19 

E ft 

15 

bC 

11 

KP 

10 

00 9 

CK 

40 

AC 

31 

KA 

24 

m 

19 

OE 

15 

EH 

11 

Ty 

10 

375o 

OT 

39 

OH 

30 














2-d. Frequent digraphs in Russian plain text whose loversala are al so 
frequent, accompanied by their frequencies from the table of 5,000 digraphs 
(Item 2-a) 


OB 

84 

BO 

58 

PO 

56 

OP 

43 

ro 

50 j 

or 

32 

BA 

35 

AB 

35 

HO 

67 

| OH 

30 

TO 

56 

OT 

39 

OJI 

41 i 

JIO 

30 

EJI 

35 

JIE 

33 

EH 

63 

| HE 

34 

HA 

54 

! AH 

45 

EP 

39 I 

PE 

37 

ET 

33 

! TE 

31 

HM 

58 

| HH 

32 











2-e. Frequent digraphs in Russian plain text whose reversals are rare 
or infrequent, accompanied by their frequencies from the table of 5,000 
digraphs (Item 2-a)* . 

np 4i j pn 2 | CK 40 j KC 3 


2-f. Doublets occurring in Russian plain text, arranged according to their 
frequencies from the table of 5>000 digraphs (Item 2-a) . 


HH 

24 

CC 

11 

EE 

7 

MM 

3 

| JIJI 

2 I 

HH 

-1 

m 

19 

00 

9 

BB 

5 

AA 

2 

1 TT 

2 1 

KK 

1 


1 j HH 


1 


1 The 24 digraphs above this line compose 25$ of Russian p’ain text. 

2 The 66 digraphs above this line compose 50$ of Russian plain text. 
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‘ l * 

2-g, The g4 digraphs appearing 100 or more times as beginnings of words in 
10,601 vords in Russian plain text, arranged according to their absolute 
frequencies . * ' 1 


np 

470 

PA 

2 B 0 

ro 

I 69 

OE 

146 

HO 

120 

KA 

110 

ME 

107 

no 

405 

HA 

246 

CE 

167 

HE 

137 

OT 

115 

HE 

no 

BC 

101 

3A 

292 

CO 

220 

CT 

l 6 l 

HE 

122 

3T 

111 

TO 

108 

HA 

101 

KO 

287 

BO 

179 

BH 

159 










3-a. The 69 trigraphs appearing 100 or more tire 3 In 67,850 letters of 
Russian plain* text, arranged, according to their absolute fioquencies 


oro 318 
EHH 2Q5 
CKO 270 
CTB 267 
OCT 260 
npo 233 
CTA 217 
0B0 264 
BOH 203 
EHH 198 


TEJI 188 
HOB 181 

EJH> 176 
OBA 169 
opo 167 

CTP I 65 
ECT 159 
AHM 158 
CKH 158 
TOB 158 


TOP 152 
JIBH 151 
noji 149 
JEEH 146 
HHX 145 
HUE 143 
HHH 143 
KOM 139 
HTE 138 
HOC 138 


ITPH 137 
PEH 137 
ETC 135 
HHH 135 
OBE 134 
KOB 130 
HHO 130 
COB 130 

upe 129 
HOr 128 


POfl 128 
KOr 123 
ABO 119 
HEP 119 
TBE 119 
SAB 118 
BAH 117 
KOH 117 
HO T A 117 
EPE 116 


POB 116 

CTH 115 
iimi 113 

ACT 112 
AHA 111 
HHE 110 
OJIb 110 

noc 110 
cto no 
Ero io4 


HES 104 
HKA 103 
TBO 103 
AEO 101 
HCT 101 
TPA 101 
BET 100 
OBC 100 
PA3 100 


3-h. The 20 trigraphs appearing 50 or core tiires as beginnings of vords 
in 10,601 vords in Russian plain text, arranged according to their 
absolute frequencies 


npo 

205 

ITPH 

95 

noc 

81 | 

BHH 

73 j 

non 

61 

CTA 

59 

toh 

51 

nPE 

116 

COB 

87 

nEP 

78 

PAE 

72 

EOJI 

60 

PA3 

53 

TOP 

50 

3AB 

108 

KOJI 

84 

non 

74 

HAP 

71 

PAfi 

60 

KOH 

52 




4. The 58 tetragraphs appearing $0 or more tiipes in 67,850 letters of 
Russian plain text, arranged, according to their absolute frequencies 



Horo 114 

TEJIB 111 
HTEJI 107 
Koro 99 
HOCT 98 
EHMH 97 
EHHO 95 
EHME 88 
nPEfl 87 
PABO 87 


COBE 

87 

ABOH 

86 

3AB0 

85 

CTBE 

84 

JTbHO 

83 

nOJIH 

82 

CKor 

82 

EHHH 

81 

AEOT 

80 


EJIBH 78 
CTBO 78 
MHEC 76 
OBET 74 
HECK 74 
AHOB 72 
OCTA 70 
CTPO 70 
BEHH 69 
TBEH 69 


BJIEH 68 
CKOft 66 
CTAB 66 

APOH 65 
EJILC 65 
ECTB 64 
CTAH 64 
HAPO 63 
OBAH 62 
CT?A 61 


npow 

60 

P0H3 

60 

OTOB 

59 

BETC 

56 

TOPO 

56 

ATEJI 

55 

toto 

55 

ETCH 

55 

CTBA 

55 


0M3B 54 
KOTO 53 
HKHX 53 
BOJCC 52 
ETCK 52 
OTOP 51 
BCKO *50 

ojko 50 

CKtfft 50 


JTEHH 80 

5- Ay^a^e_iength-0.£-VQr da_. in_Rus-9_ia.n -'Plal n._text = 6.4 
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